Worker Services
A worker service is a continuously running container that is not directly accessible from outside your stack. It's ideal for background jobs, such as processing items from a message queue or handling other asynchronous tasks.
Key features include:
- Automatic scaling: Scales based on CPU or memory usage.
- Zero-downtime deployments: New versions are deployed without interrupting the service.
- Flexible container images: Supports various methods for providing a container image, including auto-packaging for popular languages.
- Fully managed: No need to manage servers, operating systems, or virtual machines.
- Seamless connectivity: Easily connects to other resources within your stack.
How it works
Stacktape uses AWS Elastic Container Service (ECS) to run your containers on either Fargate or EC2 instances.
- Fargate is a serverless compute engine that runs containers without requiring you to manage the underlying servers.
- EC2 instances are virtual servers that give you more control over the computing environment.
ECS services are self-healing, automatically replacing any container that fails. They also scale automatically based on the rules you define.
When to use it
This table helps you choose the right container-based resource for your needs:
| Resource type | Description | Use-cases |
|---|---|---|
| web-service | A container with a public endpoint and URL. | Public APIs, websites |
| private-service | A container with a private endpoint, accessible only within your stack. | Private APIs, internal services |
| worker-service | A container that runs continuously but is not directly accessible. | Background processing, message queue consumers |
| multi-container-workload | A customizable workload with multiple containers, where you define the accessibility of each one. | Complex, multi-component services |
| batch-job | A container that runs a single job and then terminates. | One-off or scheduled data processing tasks |
Advantages
- Control over the environment: Runs any Docker image or an image built from a Dockerfile.
- Cost-effective for predictable loads: Cheaper than Lambda functions for services with steady traffic.
- Load-balanced and scalable: Automatically scales horizontally based on CPU and memory usage.
- Highly available: Runs across multiple Availability Zones to ensure resilience.
- Secure by default: The underlying environment is managed and secured by AWS.
Disadvantages
- Slower scaling: Adding new container instances can take several seconds to a few minutes, which is slower than the nearly-instant scaling of Lambda functions.
- Not fully serverless: Cannot scale down to zero. You pay for at least one running instance (starting at ~$8/month), even if it's idle.
Basic usage
Here's a basic example of a worker service configuration:
resources:myWorkerService:type: worker-serviceproperties:packaging:type: stacktape-image-buildpackproperties:entryfilePath: src/main.tsresources:cpu: 2memory: 2048
Example worker service configuration.
And here's the corresponding application code:
import myContinuouslyRunningApp from './my-app';const app = myContinuouslyRunningApp();app.run();
Example worker container in TypeScript (main.ts).
Image
A worker service runs a Docker image. You can provide this image in four ways:
- stacktape-image-buildpack: Automatically packages your code without needing a Dockerfile.
- external-buildpack: Uses external buildpacks to create an image.
- custom-dockerfile: Builds an image from your own Dockerfile.
- prebuilt-images: Uses an existing image from a container registry.
Environment variables
A list of environment variables to pass to the script or command.
Values can be:
- A static string, number, or boolean.
- The result of a custom directive.
- A reference to another resource's parameter using the `$ResourceParam` directive.
- A value from a secret using the `$Secret` directive.
resources:myWorkerService:type: worker-serviceproperties:packaging:type: stacktape-image-buildpackproperties:entryfilePath: src/main.tsenvironment:- name: STATIC_ENV_VARvalue: my-env-var- name: DYNAMICALLY_SET_ENV_VARvalue: $MyCustomDirective('input-for-my-directive')- name: DB_HOSTvalue: $ResourceParam('myDatabase', 'host')- name: DB_PASSWORDvalue: $Secret('dbSecret.password')resources:cpu: 2memory: 2048
Health check
Health checks monitor your container to ensure it's running correctly. If a container fails its health check, it's automatically terminated and replaced with a new one.
For example, this health check uses curl to send a request to the service every 20 seconds. If the request fails or takes longer than 5 seconds, the check is considered failed.
resources:myWorkerService:type: worker-serviceproperties:packaging:type: stacktape-image-buildpackproperties:entryfilePath: src/index.tsinternalHealthCheck:healthCheckCommand: ['CMD-SHELL', 'curl -f http://localhost/ || exit 1']intervalSeconds: 20timeoutSeconds: 5startPeriodSeconds: 150retries: 2resources:cpu: 2memory: 2048
Shutdown
When a service instance is shut down (for example, during a deployment or when the stack is deleted), all of its containers receive a SIGTERM signal. This gives your application a chance to shut down gracefully.
By default, the application has 2 seconds to clean up before it's forcefully stopped with a SIGKILL signal. You can change this with the stopTimeout property (from 2 to 120 seconds).
process.on('SIGTERM', () => {console.info('Received SIGTERM signal. Cleaning up and exiting process...');// Finish any outstanding requests, or close a database connection...process.exit(0);});
Example of a cleanup function that runs before the container shuts down.
Logging
Anything your application writes to stdout or stderr is captured and stored in AWS CloudWatch.
You can view logs in a few ways:
- Stacktape Console: Find a direct link to the logs in the Stacktape Console.
- Stacktape CLI: Use the
stacktape logscommand to stream logs to your terminal. - AWS Console: Browse logs directly in the AWS CloudWatch console. The
stacktape stack-infocommand can provide a link.
Log storage can be expensive. To manage costs, you can configure retentionDays to automatically delete logs after a certain period.
Forwarding logs
You can forward logs to third-party services. See Forwarding Logs for more information.
Compute resources
In the resources section, you configure the CPU, memory, and instance types for your service. You can run your containers using either Fargate or EC2 instances.
- Fargate is a serverless option that lets you run containers without managing servers. You only need to specify the
cpuandmemoryyour service requires. It's a good choice for applications that need to meet high security standards like PCI DSS Level 1 and SOC 2. - EC2 instances are virtual servers that give you more control. You choose the instance types that best fit your needs, and ECS places your containers on them.
Regardless of whether you use Fargate or EC2 instances, your containers run securely within a VPC.
Configures the CPU, memory, and underlying compute engine for the service container.
You can choose between two compute engines:
- Fargate: A serverless engine that abstracts away server management. To use Fargate, specify
cpuandmemorywithoutinstanceTypes. - EC2: Provides direct control over the underlying virtual servers. To use EC2, specify the desired
instanceTypes.
Using Fargate
To use Fargate, specify cpu and memory in the resources section without including instanceTypes.
resources:myWorkerService:type: worker-serviceproperties:packaging:type: stacktape-image-buildpackproperties:entryfilePath: src/index.tsresources:cpu: 0.25memory: 512
Example of a service running on Fargate.
Using EC2 instances
To use EC2 instances, specify a list of instanceTypes in the resources section.
Instances are automatically added or removed to meet scaling demands.
Recommendation: For optimal resource utilization, specify a single instance type and omit the cpu and memory properties. Stacktape will then size the containers to fit the instance perfectly.
The order of instance types matters; the first in the list is preferred. For a full list of instance types, see the AWS EC2 instance types documentation.
Instances are automatically refreshed weekly to ensure they are patched and up-to-date. Your workload remains available during this process.
resources:myWebService:type: worker-serviceproperties:packaging:type: stacktape-image-buildpackproperties:entryfilePath: src/index.tsresources:instanceTypes:- c5.large
Example of a service running on EC2 instances.
Container placement on EC2
Stacktape tries to use your EC2 instances as efficiently as possible.
- If you specify
instanceTypeswithoutcpuandmemory, Stacktape configures each service instance to use the full resources of one EC2 instance. When the service scales out, a new EC2 instance is added for each new service instance. - If you specify
cpuandmemory, AWS will place multiple service instances on a single EC2 instance if there's enough capacity, maximizing utilization.
Default CPU and memory for EC2
- If
cpuis not specified, containers on an EC2 instance share its CPU capacity. - If
memoryis not specified, Stacktape sets the memory to the maximum amount available on the smallest instance type in yourinstanceTypeslist.
Using a warm pool
A warm pool keeps pre-initialized EC2 instances in a stopped state, allowing your service to scale out much faster. This is useful for handling sudden traffic spikes. You only pay for the storage of stopped instances, not for compute time.
To enable it, set enableWarmPool to true. This feature is only available when you specify exactly one instance type.
For more details, see the AWS Auto Scaling warm pools documentation.
resources:myWebService:type: web-serviceproperties:packaging:type: stacktape-image-buildpackproperties:entryfilePath: src/index.tsresources:instanceTypes:- c5.largeenableWarmPool: true
Scaling
The scaling section lets you control how your service scales. You can set the minimum and maximum number of running instances and define a policy that triggers scaling actions.
Scaling policy
A scaling policy defines the CPU and memory thresholds that trigger scaling.
- Scaling out (adding instances): The service scales out if either the average CPU or memory utilization exceeds the target you set.
- Scaling in (removing instances): The service scales in only when both CPU and memory utilization are below their target values.
The scaling process is more aggressive when adding capacity than when removing it. This helps ensure your application can handle sudden increases in load, while scaling in more cautiously to prevent flapping (scaling in and out too frequently).
resources:myWorkerService:type: worker-serviceproperties:packaging:type: stacktape-image-buildpackproperties:entryfilePath: src/index.tsresources:cpu: 0.5memory: 1024scaling:minInstances: 1maxInstances: 5scalingPolicy:keepAvgMemoryUtilizationUnder: 80keepAvgCpuUtilizationUnder: 80
Example of a scaling configuration.
Storage
Each service instance has its own temporary, or ephemeral storage, with a fixed size of 20GB. This storage is deleted when the instance is removed. Different instances of the same service do not share their storage.
For persistent data storage, use Buckets.
Accessing other resources
By default, AWS resources cannot communicate with each other. Access must be granted using IAM permissions.
Stacktape automatically configures the necessary permissions for the services it manages. For example, it allows a worker service to write logs to CloudWatch.
However, if your application needs to access other resources, you must grant permissions manually. You can do this in two ways:
Using connectTo
The connectTo property lets you grant access to other Stacktape-managed resources by simply listing their names. Stacktape automatically configures the required IAM permissions and injects connection details as environment variables into your service.
resources:photosBucket:type: bucketmyWorkerService:type: worker-serviceproperties:packaging:type: stacktape-image-buildpackproperties:entryfilePath: src/index.tsconnectTo:# access to the bucket- photosBucket# access to AWS SES- aws:sesresources:cpu: 0.25memory: 512
Configures access to other resources in your stack and AWS services. By specifying resources here, Stacktape automatically:
- Configures IAM role permissions.
- Sets up security group rules to allow network traffic.
- Injects environment variables with connection details into the compute resource.
Environment variables are named STP_[RESOURCE_NAME]_[VARIABLE_NAME] (e.g., STP_MY_DATABASE_CONNECTION_STRING).
Using iamRoleStatements
For more granular control, you can provide a list of raw IAM role statements. These statements are added to the service's IAM role, allowing you to define precise permissions for any AWS resource.
resources:myWorkerService:type: worker-serviceproperties:packaging:type: stacktape-image-buildpackproperties:entryfilePath: server/index.tsiamRoleStatements:- Resource:- $CfResourceParam('NotificationTopic', 'Arn')Effect: 'Allow'Action:- 'sns:Publish'resources:cpu: 2memory: 2048cloudformationResources:NotificationTopic:Type: 'AWS::SNS::Topic'
Default VPC connection
Some AWS services, like relational databases, must be deployed within a VPC. If your stack includes such resources, Stacktape automatically creates a default VPC and connects them to it.
Worker services are connected to this default VPC by default, allowing them to communicate with other VPC-based resources without extra configuration.
To learn more, see the documentation on VPCs and resource accessibility.
Pricing
When using Fargate, you are charged for:
- vCPU per hour: ~$0.04 - $0.07, depending on the region.
- Memory (GB) per hour: ~$0.004 - $0.008, depending on the region.
Usage is billed by the second, with a one-minute minimum. For more details, see AWS Fargate pricing.