Serverless in 2023 still has a gap between promise and reality

Serverless computing has been hailed as 'the future' since AWS Lambda launched in 2014. Nearly a decade later, one thing is clear: serverless is well-suited for certain tasks but not others. The challenge lies in distinguishing between the two.

Serverless shines in event-driven processing. Tasks like processing images after they're uploaded, sending emails based on user actions, running scheduled batch jobs, and handling webhook payloads are all well-suited for serverless. You only pay for the compute you use, not for idle capacity, which can lead to significant cost savings for irregular or unpredictable workloads compared to always-on containers.

However, cold starts remain a major limitation of serverless. When a function hasn't been invoked recently, the runtime needs to initialize a new execution environment. For AWS Lambda functions running .NET or Java, this can take 1-3 seconds. For user-facing APIs where latency is critical, this delay is unacceptable. To mitigate this, you can use provisioned concurrency to keep warm instances ready, opt for lighter runtimes like Node or Python that start faster than .NET or Java, or use native AOT compilation for .NET Lambda, which can reduce cold starts to under 100ms.

In my experience with serverless at scale, one major challenge is dealing with timeouts. For example, AWS Lambda has a maximum timeout of 15 minutes. If your function takes longer than that to execute, you'll need to redesign it or use a different approach. I once worked on a project where a data processing function took up to 30 minutes to complete. We had to switch to a containerized solution to avoid the timeout limitation.

Another challenge with serverless is observability. Serverless functions are ephemeral and distributed, meaning a single user request can trigger dozens of functions across a call chain. Correlating logs, traces, and metrics across Lambda invocations requires careful instrumentation. Currently, OpenTelemetry with AWS X-Ray or third-party providers like Datadog is the standard approach. Teams that skip distributed tracing in serverless architectures often spend too much time debugging production issues that could be easily diagnosed with traces.

When it comes to cost, serverless can be a real win. For example, I worked on a project where we used AWS Lambda to process logs from a Kubernetes cluster. The logs were irregular and unpredictable, and we only needed to process them occasionally. By using serverless, we were able to reduce our costs by over 70% compared to running a dedicated container cluster. However, for workloads with consistent traffic, containers can be more cost-effective.

The key is choosing the right model for the task at hand. It's not about serverless versus containers; it's about matching the infrastructure to the workload characteristics. Serverless is well-suited for event-driven processing and APIs with irregular traffic. Containers are better for long-running processes, latency-sensitive APIs with consistent traffic, and machine learning inference. Most production architectures use a combination of both. A common engineering mistake is trying to force one model on all workloads for the sake of consistency rather than matching the infrastructure to the workload needs.

When evaluating serverless, consider the specific requirements of your workload. If it's event-driven and doesn't require constant uptime, serverless can be a cost-effective option. However, if your workload requires low latency and consistent performance, containers might be a better fit.

Ultimately, the decision between serverless and containers comes down to understanding the strengths and weaknesses of each approach and choosing the one that best aligns with your workload requirements. By doing so, you can ensure that your infrastructure is optimized for performance, cost, and scalability.

In many cases, the best approach is to use a combination of serverless and containers. This allows you to take advantage of the benefits of each approach while minimizing their respective drawbacks. For example, you could use serverless for event-driven processing and containers for long-running processes.