Attribute-based instance type selection for Spot Fleet
When you create a Spot Fleet, you must specify one or more instance types for configuring the On-Demand Instances and Spot Instances in the fleet. As an alternative to manually specifying the instance types, you can specify the attributes that an instance must have, and Amazon EC2 will identify all the instance types with those attributes. This is known as attribute-based instance type selection. For example, you can specify the minimum and maximum number of vCPUs required for your instances, and Spot Fleet will launch the instances using any available instance types that meet those vCPU requirements.
Attribute-based instance type selection is ideal for workloads and frameworks that can be flexible about what instance types they use, such as when running containers or web fleets, processing big data, and implementing continuous integration and deployment (CI/CD) tooling.
Benefits
Attribute-based instance type selection has the following benefits:
-
With so many instance types available, finding the right instance types for your workload can be time consuming. When you specify instance attributes, the instance types will automatically have the required attributes for your workload.
-
To manually specify multiple instance types for a Spot Fleet, you must create a separate launch template override for each instance type. But with attribute-based instance type selection, to provide multiple instance types, you need only specify the instance attributes in the launch template or in a launch template override.
-
When you specify instance attributes rather than instance types, your fleet can use newer generation instance types as they’re released, "future proofing" the fleet's configuration.
-
When you specify instance attributes rather than instance types, Spot Fleet can select from a wide range of instance types for launching Spot Instances, which adheres to the Spot best practice of instance type flexibility.
Topics
How attribute-based instance type selection works
To use attribute-based instance type selection in your fleet configuration, you replace the list of instance types with a list of instance attributes that your instances require. Spot Fleet will launch instances on any available instance types that have the specified instance attributes.
Topics
Types of instance attributes
There are several instance attributes that you can specify to express your compute requirements. For a description of each attribute and the default values, see InstanceRequirements in the Amazon EC2 API Reference.
Where to configure attribute-based instance type selection
Depending on whether you use the console or the Amazon CLI, you can specify the instance attributes for attribute-based instance type selection as follows:
In the console, you can specify the instance attributes in one or both of the following fleet configuration components:
-
In a launch template, and then reference the launch template in the fleet request
-
In the fleet request
In the Amazon CLI, you can specify the instance attributes in one or all of the following fleet configuration components:
-
In a launch template, and reference the launch template in the fleet request
-
In a launch template override
If you want a mix of instances that use different AMIs, you can specify instance attributes in multiple launch template overrides. For example, different instance types can use x86 and Arm-based processors.
-
In a launch specification
How Spot Fleet uses attribute-based instance type selection when provisioning a fleet
Spot Fleet provisions a fleet in the following way:
-
Spot Fleet identifies the instance types that have the specified attributes.
-
Spot Fleet uses price protection to determine which instance types to exclude.
-
Spot Fleet determines the capacity pools from which it will consider launching the instances based on the Amazon Regions or Availability Zones that have matching instance types.
-
Spot Fleet applies the specified allocation strategy to determine from which capacity pools to launch the instances.
Note that attribute-based instance type selection does not pick the capacity pools from which to provision the fleet; that's the job of the allocation strategies. There might be a large number of instance types with the specified attributes, and some of them might be expensive. The default allocation strategy of
lowestPricefor Spot and On-Demand guarantees that Spot Fleet will launch instances from the least expensive capacity pools.If you specify an allocation strategy, Spot Fleet will launch instances according to the specified allocation strategy.
-
For Spot Instances, attribute-based instance type selection supports the
capacityOptimizedPrioritized,capacityOptimizedandlowestPriceallocation strategies. -
For On-Demand Instances, attribute-based instance type selection supports the
lowestPriceallocation strategy.
-
-
If there is no capacity for the instance types with the specified instance attributes, no instances can be launched, and the fleet returns an error.
Price protection
Price protection is a feature that prevents your Spot Fleet from using instance types that you would consider too expensive even if they happen to fit the attributes that you specified. When you create a fleet with attribute-based instance type selection, price protection is enabled by default, with separate thresholds for On-Demand Instances and Spot Instances. When Amazon EC2 selects instance types with your attributes, it excludes instance types priced above your threshold. The thresholds represent the maximum you'll pay, expressed as a percentage above the least expensive current generation M, C, or R instance type with your specified attributes.
If you don't specify a threshold, the following thresholds are used by default:
-
For On-Demand Instances, the price protection threshold is set at 20 percent.
-
For Spot Instances, the price protection threshold is set at 100 percent.
To specify the price protection threshold
While creating the Spot Fleet, configure the fleet for attribute-based instance type selection, and then do the following:
-
Console
To specify the On-Demand Instance price protection threshold, under Additional instance attribute, choose On-demand price protection, and then choose Add attribute. For On-Demand price protection percentage, enter the price protection threshold as a percentage.
To specify the Spot Instance price protection threshold, under Additional instance attribute, choose Spot price protection, and then choose Add attribute. For Spot price protection percentage, enter the price protection threshold as a percentage.
-
Amazon CLI
To specify the On-Demand Instance price protection threshold, in the JSON configuration file, in the
InstanceRequirementsstructure, forOnDemandMaxPricePercentageOverLowestPrice, enter the price protection threshold as a percentage.To specify the Spot Instance price protection threshold, in the JSON configuration file, in the
InstanceRequirementsstructure, forSpotMaxPricePercentageOverLowestPrice, enter the price protection threshold as a percentage.
For more information about creating the fleet, see Create a Spot Fleet with attribute-based instance type selection.
Note
When creating the Spot Fleet, if you set Total target capacity type
to vCPUs or Memory (MiB) (console) or
TargetCapacityUnitType to vcpu or
memory-mib (Amazon CLI), the price protection threshold is applied
based on the per-vCPU or per-memory price instead of the per-instance
price.
Considerations
-
You can specify either instance types or instance attributes in a Spot Fleet, but not both at the same time.
When using the CLI, the launch template overrides will override the launch template. For example, if the launch template contains an instance type and the launch template override contains instance attributes, the instances that are identified by the instance attributes will override the instance type in the launch template.
-
When using the CLI, when you specify instance attributes as overrides, you can't also specify weights or priorities.
-
You can specify a maximum of four
InstanceRequirementsstructures in a request configuration.
Create a Spot Fleet with attribute-based instance type selection
You can configure a fleet to use attribute-based instance type selection by using the Amazon EC2 console or the Amazon CLI.
Create a Spot Fleet using the console
To configure a Spot Fleet for attribute-based instance type selection (console)
Open the Amazon EC2 console at https://console.amazonaws.cn/ec2/
. -
In the navigation pane, choose Spot Requests, and then choose Request Spot Instances.
-
Follow the steps to create a Spot Fleet. For more information, see Create a Spot Fleet request using defined parameters (console).
While creating the Spot Fleet, configure the fleet for attribute-based instance type selection as follows:
-
For Instance type requirements, choose Specify instance attributes that match your compute requirements.
-
For vCPUs, enter the desired minimum and maximum number of vCPUs. To specify no limit, select No minimum, No maximum, or both.
-
For Memory (GiB), enter the desired minimum and maximum amount of memory. To specify no limit, select No minimum, No maximum, or both.
-
(Optional) For Additional instance attributes, you can optionally specify one or more attributes to express your compute requirements in more detail. Each additional attribute adds further constraints to your request.
-
(Optional) Expand Preview matching instance types to view the instance types that have your specified attributes.
-
Create a Spot Fleet using the Amazon CLI
To configure a Spot Fleet for attribute-based instance type selection (Amazon CLI)
Use the request-spot-fleet (Amazon CLI) command to create a Spot Fleet. Specify the fleet configuration in a JSON file.
aws ec2 request-spot-fleet \ --regionus-east-1\ --spot-fleet-request-config file://file_name.json
Example filefile_name.json
The following example contains the parameters that configure a Spot Fleet to use attribute-based instance type selection, and is followed by a text explanation.
{ "AllocationStrategy": "priceCapacityOptimized", "TargetCapacity":20, "Type": "request", "LaunchTemplateConfigs": [{ "LaunchTemplateSpecification": { "LaunchTemplateName": "my-launch-template", "Version": "1" }, "Overrides": [{ "InstanceRequirements": { "VCpuCount": { "Min":2}, "MemoryMiB": { "Min":4} } }] }] }
The attributes for attribute-based instance type selection are specified in the
InstanceRequirements structure. In this example, two attributes are
specified:
-
VCpuCount– A minimum of 2 vCPUs is specified. Because no maximum is specified, there is no maximum limit. -
MemoryMiB– A minimum of 4 MiB of memory is specified. Because no maximum is specified, there is no maximum limit.
Any instance types that have 2 or more vCPUs and 4 MiB or more of memory will be identified. However, price protection and the allocation strategy might exclude some instance types when Spot Fleet provisions the fleet.
For a list and descriptions of all the possible attributes that you can specify, see InstanceRequirements in the Amazon EC2 API Reference.
Note
When InstanceRequirements is included in the fleet configuration,
InstanceType and WeightedCapacity must be
excluded; they cannot determine the fleet configuration at the same time as
instance attributes.
The JSON also contains the following fleet configuration:
-
"AllocationStrategy": "– The allocation strategy for the Spot Instances in the fleet.priceCapacityOptimized" -
"LaunchTemplateName": "– The launch template contains some instance configuration information, but if any instance types are specified, they will be overridden by the attributes that are specified inmy-launch-template", "Version": ""1InstanceRequirements. -
"TargetCapacity":– The target capacity is 20 instances.20 -
"Type": "– The request type for the fleet isrequest"request.
Examples of configurations that are valid and not valid
If you use the Amazon CLI to create a Spot Fleet, you must make sure that your fleet configuration is valid. The following examples show configurations that are valid and not valid.
Configurations are considered not valid when they contain the following:
-
A single
Overridesstructure with bothInstanceRequirementsandInstanceType -
Two
Overridesstructures, one withInstanceRequirementsand the other withInstanceType -
Two
InstanceRequirementsstructures with overlapping attribute values within the sameLaunchTemplateSpecification
Example configurations
- Valid configuration: Single launch template with overrides
- Valid configuration: Single launch template with multiple InstanceRequirements
- Valid configuration: Two launch templates, each with overrides
- Valid configuration: Only InstanceRequirements specified, no overlapping attribute values
- Configuration not valid: Overrides contain InstanceRequirements and InstanceType
- Configuration not valid: Two Overrides contain InstanceRequirements and InstanceType
- Configuration not valid: Overlapping attribute values
Valid configuration: Single launch template with overrides
The following configuration is valid. It contains one launch template and one
Overrides structure containing one
InstanceRequirements structure. A text explanation of the example
configuration follows.
{ "SpotFleetRequestConfig": { "AllocationStrategy": "lowestPrice", "ExcessCapacityTerminationPolicy": "default", "IamFleetRole": "arn:aws:iam::000000000000:role/aws-ec2-spot-fleet-tagging-role", "LaunchTemplateConfigs": [ { "LaunchTemplateSpecification": { "LaunchTemplateName": "My-launch-template", "Version": "1" }, "Overrides": [ { "InstanceRequirements": { "VCpuCount": { "Min": 2, "Max": 8 }, "MemoryMib": { "Min": 0, "Max": 10240 }, "MemoryGiBPerVCpu": { "Max": 10000 }, "RequireHibernateSupport": true } } ] } ], "TargetCapacity": 5000, "OnDemandTargetCapacity": 0, "TargetCapacityUnitType": "vcpu" } }
InstanceRequirements
To use attribute-based instance selection, you must include the
InstanceRequirements structure in your fleet configuration, and
specify the desired attributes for the instances in the fleet.
In the preceding example, the following instance attributes are specified:
-
VCpuCount– The instance types must have a minimum of 2 and a maximum of 8 vCPUs. -
MemoryMiB– The instance types must have a maximum of 10240 MiB of memory. A minimum of 0 indicates no minimum limit. -
MemoryGiBPerVCpu– The instance types must have a maximum of 10,000 GiB of memory per vCPU. TheMinparameter is optional. By omitting it, you indicate no minimum limit.
TargetCapacityUnitType
The TargetCapacityUnitType parameter specifies the unit for the target
capacity. In the example, the target capacity is 5000 and the
target capacity unit type is vcpu, which together specify a desired
target capacity of 5,000 vCPUs. Spot Fleet will launch enough instances so that
the total number of vCPUs in the fleet is 5,000 vCPUs.
Valid configuration: Single launch template with multiple InstanceRequirements
The following configuration is valid. It contains one launch template and one
Overrides structure containing two
InstanceRequirements structures. The attributes specified in
InstanceRequirements are valid because the values do not
overlap—the first InstanceRequirements structure specifies a
VCpuCount of 0-2 vCPUs, while the second
InstanceRequirements structure specifies 4-8 vCPUs.
{ "SpotFleetRequestConfig": { "AllocationStrategy": "lowestPrice", "ExcessCapacityTerminationPolicy": "default", "IamFleetRole": "arn:aws:iam::000000000000:role/aws-ec2-spot-fleet-tagging-role", "LaunchTemplateConfigs": [ { "LaunchTemplateSpecification": { "LaunchTemplateName": "MyLaunchTemplate", "Version": "1" }, "Overrides": [ { "InstanceRequirements": { "VCpuCount": { "Min": 0, "Max": 2 }, "MemoryMiB": { "Min": 0 } } }, { "InstanceRequirements": { "VCpuCount": { "Min": 4, "Max": 8 }, "MemoryMiB": { "Min": 0 } } } ] } ], "TargetCapacity": 1, "OnDemandTargetCapacity": 0, "Type": "maintain" } }
Valid configuration: Two launch templates, each with overrides
The following configuration is valid. It contains two launch templates, each with one
Overrides structure containing one
InstanceRequirements structure. This configuration is useful for
arm and x86 architecture support in the same
fleet.
{ "SpotFleetRequestConfig": { "AllocationStrategy": "lowestPrice", "ExcessCapacityTerminationPolicy": "default", "IamFleetRole": "arn:aws:iam::000000000000:role/aws-ec2-spot-fleet-tagging-role", "LaunchTemplateConfigs": [ { "LaunchTemplateSpecification": { "LaunchTemplateName": "armLaunchTemplate", "Version": "1" }, "Overrides": [ { "InstanceRequirements": { "VCpuCount": { "Min": 0, "Max": 2 }, "MemoryMiB": { "Min": 0 } } }, { "LaunchTemplateSpecification": { "LaunchTemplateName": "x86LaunchTemplate", "Version": "1" }, "Overrides": [ { "InstanceRequirements": { "VCpuCount": { "Min": 0, "Max": 2 }, "MemoryMiB": { "Min": 0 } } } ] } ], "TargetCapacity": 1, "OnDemandTargetCapacity": 0, "Type": "maintain" } }
Valid configuration: Only
InstanceRequirements specified, no overlapping attribute
values
The following configuration is valid. It contains two
LaunchTemplateSpecification structures, each with a launch template
and an Overrides structure containing an
InstanceRequirements structure. The attributes specified in
InstanceRequirements are valid because the values do not
overlap—the first InstanceRequirements structure specifies a
VCpuCount of 0-2 vCPUs, while the second
InstanceRequirements structure specifies 4-8 vCPUs.
{ "SpotFleetRequestConfig": { "AllocationStrategy": "lowestPrice", "ExcessCapacityTerminationPolicy": "default", "IamFleetRole": "arn:aws:iam::000000000000:role/aws-ec2-spot-fleet-tagging-role", "LaunchTemplateConfigs": [ { "LaunchTemplateSpecification": { "LaunchTemplateName": "MyLaunchTemplate", "Version": "1" }, "Overrides": [ { "InstanceRequirements": { "VCpuCount": { "Min": 0, "Max": 2 }, "MemoryMiB": { "Min": 0 } } } ] }, { "LaunchTemplateSpecification": { "LaunchTemplateName": "MyOtherLaunchTemplate", "Version": "1" }, "Overrides": [ { "InstanceRequirements": { "VCpuCount": { "Min": 4, "Max": 8 }, "MemoryMiB": { "Min": 0 } } } ] } ], "TargetCapacity": 1, "OnDemandTargetCapacity": 0, "Type": "maintain" } }
Configuration not valid:
Overrides contain InstanceRequirements and
InstanceType
The following configuration is not valid. The Overrides structure
contains both InstanceRequirements and InstanceType. For
the Overrides, you can specify either InstanceRequirements
or InstanceType, but not both.
{
"SpotFleetRequestConfig": {
"AllocationStrategy": "lowestPrice",
"ExcessCapacityTerminationPolicy": "default",
"IamFleetRole": "arn:aws:iam::000000000000:role/aws-ec2-spot-fleet-tagging-role",
"LaunchTemplateConfigs": [
{
"LaunchTemplateSpecification": {
"LaunchTemplateName": "MyLaunchTemplate",
"Version": "1"
},
"Overrides": [
{
"InstanceRequirements": {
"VCpuCount": {
"Min": 0,
"Max": 2
},
"MemoryMiB": {
"Min": 0
}
}
},
{
"InstanceType": "m5.large"
}
]
}
],
"TargetCapacity": 1,
"OnDemandTargetCapacity": 0,
"Type": "maintain"
}
}
Configuration not valid: Two
Overrides contain InstanceRequirements and
InstanceType
The following configuration is not valid. The Overrides structures
contain both InstanceRequirements and InstanceType. You
can specify either InstanceRequirements or InstanceType,
but not both, even if they're in different Overrides structures.
{
"SpotFleetRequestConfig": {
"AllocationStrategy": "lowestPrice",
"ExcessCapacityTerminationPolicy": "default",
"IamFleetRole": "arn:aws:iam::000000000000:role/aws-ec2-spot-fleet-tagging-role",
"LaunchTemplateConfigs": [
{
"LaunchTemplateSpecification": {
"LaunchTemplateName": "MyLaunchTemplate",
"Version": "1"
},
"Overrides": [
{
"InstanceRequirements": {
"VCpuCount": {
"Min": 0,
"Max": 2
},
"MemoryMiB": {
"Min": 0
}
}
}
]
},
{
"LaunchTemplateSpecification": {
"LaunchTemplateName": "MyOtherLaunchTemplate",
"Version": "1"
},
"Overrides": [
{
"InstanceType": "m5.large"
}
]
}
],
"TargetCapacity": 1,
"OnDemandTargetCapacity": 0,
"Type": "maintain"
}
}
Configuration not valid: Overlapping attribute values
The following configuration is not valid. The two
InstanceRequirements structures each contain "VCpuCount":
{"Min": 0, "Max": 2}. The values for these attributes overlap, which will
result in duplicate capacity pools.
{
"SpotFleetRequestConfig": {
"AllocationStrategy": "lowestPrice",
"ExcessCapacityTerminationPolicy": "default",
"IamFleetRole": "arn:aws:iam::000000000000:role/aws-ec2-spot-fleet-tagging-role",
"LaunchTemplateConfigs": [
{
"LaunchTemplateSpecification": {
"LaunchTemplateName": "MyLaunchTemplate",
"Version": "1"
},
"Overrides": [
{
"InstanceRequirements": {
"VCpuCount": {
"Min": 0,
"Max": 2
},
"MemoryMiB": {
"Min": 0
}
},
{
"InstanceRequirements": {
"VCpuCount": {
"Min": 0,
"Max": 2
},
"MemoryMiB": {
"Min": 0
}
}
}
}
]
}
],
"TargetCapacity": 1,
"OnDemandTargetCapacity": 0,
"Type": "maintain"
}
}
Preview instance types with specified attributes
You can use the get-instance-types-from-instance-requirements Amazon CLI command to preview the instance types that match the attributes that you specify. This is especially useful for working out what attributes to specify in your request configuration without launching any instances. Note that the command does not consider available capacity.
To preview a list of instance types by specifying attributes using the Amazon CLI
-
(Optional) To generate all of the possible attributes that can be specified, use the get-instance-types-from-instance-requirements command and the
--generate-cli-skeletonparameter. You can optionally direct the output to a file to save it by usinginput >.attributes.jsonaws ec2 get-instance-types-from-instance-requirements \ --region us-east-1 \ --generate-cli-skeleton input >attributes.jsonExpected output
{ "DryRun": true, "ArchitectureTypes": [ "i386" ], "VirtualizationTypes": [ "hvm" ], "InstanceRequirements": { "VCpuCount": { "Min": 0, "Max": 0 }, "MemoryMiB": { "Min": 0, "Max": 0 }, "CpuManufacturers": [ "intel" ], "MemoryGiBPerVCpu": { "Min": 0.0, "Max": 0.0 }, "ExcludedInstanceTypes": [ "" ], "InstanceGenerations": [ "current" ], "SpotMaxPricePercentageOverLowestPrice": 0, "OnDemandMaxPricePercentageOverLowestPrice": 0, "BareMetal": "included", "BurstablePerformance": "included", "RequireHibernateSupport": true, "NetworkInterfaceCount": { "Min": 0, "Max": 0 }, "LocalStorage": "included", "LocalStorageTypes": [ "hdd" ], "TotalLocalStorageGB": { "Min": 0.0, "Max": 0.0 }, "BaselineEbsBandwidthMbps": { "Min": 0, "Max": 0 }, "AcceleratorTypes": [ "gpu" ], "AcceleratorCount": { "Min": 0, "Max": 0 }, "AcceleratorManufacturers": [ "nvidia" ], "AcceleratorNames": [ "a100" ], "AcceleratorTotalMemoryMiB": { "Min": 0, "Max": 0 }, "NetworkBandwidthGbps": { "Min": 0.0, "Max": 0.0 }, "AllowedInstanceTypes": [ "" ] }, "MaxResults": 0, "NextToken": "" } -
Create a JSON configuration file using the output from the previous step, and configure it as follows:
Note
You must provide values for
ArchitectureTypes,VirtualizationTypes,VCpuCount, andMemoryMiB. You can omit the other attributes; when omitted, the default values are used.For a description of each attribute and their default values, see get-instance-types-from-instance-requirements in the Amazon EC2 Command Line Reference.
-
For
ArchitectureTypes, specify one or more types of processor architecture. -
For
VirtualizationTypes, specify one or more types of virtualization. -
For
VCpuCount, specify the minimum and maximum number of vCPUs. To specify no minimum limit, forMin, specify0. To specify no maximum limit, omit theMaxparameter. -
For
MemoryMiB, specify the minimum and maximum amount of memory in MiB. To specify no minimum limit, forMin, specify0. To specify no maximum limit, omit theMaxparameter. -
You can optionally specify one or more of the other attributes to further constrain the list of instance types that are returned.
-
-
To preview the instance types that have the attributes that you specified in the JSON file, use the get-instance-types-from-instance-requirements command, and specify the name and path to your JSON file by using the
--cli-input-jsonparameter. You can optionally format the output to appear in a table format.aws ec2 get-instance-types-from-instance-requirements \ --cli-input-json file://attributes.json\ --output tableExample
attributes.jsonfileIn this example, the required attributes are included in the JSON file. They are
ArchitectureTypes,VirtualizationTypes,VCpuCount, andMemoryMiB. In addition, the optionalInstanceGenerationsattribute is also included. Note that forMemoryMiB, theMaxvalue can be omitted to indicate that there is no limit.{ "ArchitectureTypes": [ "x86_64" ], "VirtualizationTypes": [ "hvm" ], "InstanceRequirements": { "VCpuCount": { "Min": 4, "Max": 6 }, "MemoryMiB": { "Min": 2048 }, "InstanceGenerations": [ "current" ] } }Example output
------------------------------------------ |GetInstanceTypesFromInstanceRequirements| +----------------------------------------+ || InstanceTypes || |+--------------------------------------+| || InstanceType || |+--------------------------------------+| || c4.xlarge || || c5.xlarge || || c5a.xlarge || || c5ad.xlarge || || c5d.xlarge || || c5n.xlarge || || c6a.xlarge || ... -
After identifying instance types that meet your needs, make note of the instance attributes that you used so that you can use them when configuring your fleet request.