Skip to content

Releases: aws-samples/aws-eda-slurm-cluster

aws-eda-slurm-cluster v2.5.0

09 Sep 18:52
8dff7cd

Choose a tag to compare

What's Changed

  • Add support for ParallelCluster versions 3.9.0 and 3.9.1 by @cartalla in #232

New Features

  • Feature #229: Add support for ParallelCluster version 3.9.0 and 3.9.1

Bug Fixes

  • Bug #204: Can only configure 3 clusters on a submitter host
  • Bug #230: Python 3.8 Lambda deprecated on 10/12/2024
    Update lambdas to use new version of python
  • Bug #231: Cluster fails to deploy because create_slurm_accounts.py fails

Full Changelog: v2.4.0...v2.5.0

aws-eda-slurm-cluster v2.4.0

09 Sep 17:56
ded618c

Choose a tag to compare

What's Changed

Add the following config options:

  • slurm/ParallelClusterConfig/ClusterConfig
  • slurm/SlurmCtl/AdditionalSecurityGroups
  • slurm/SlurmCtl/AdditionalIamPolicies
  • slurm/SlurmCtl/Imds/Secured
  • slurm/InstanceConfig/AdditionalSecurityGroups
  • slurm/InstanceConfig/AdditionalIamPolicies

Added documentation for all config parameters.

Changed the StackName default from slurm-top to slurm-config.

Fix the slurm/ParallelClusterConfig/Dcv/Enabled option.
Change the option name from Enable to Enabled to match ParallelCluster.

Fix the setting of ParallelCluster HeadNode/Dcv/AllowedIps config
Was setting from non-existent slurm/ParallelClusterConfig/AllowedIps instead of slurm/ParallelClusterConfig/HeadNode/Dcv/AllowedIps.

Delete the following config option because it uses legacy cluster.

  • slurm/EdaSlurmClusterStackName

New Features

  • Feature #225: Add custom IAM policies and security groups for head and compute
    Add config options for extra security groups and iam policies for hea… by @cartalla in #228

Full Changelog: v2.3.4...v2.4.0

aws-eda-slurm-cluster v2.3.4

09 Sep 17:25
396fa78

Choose a tag to compare

What's Changed

New Features

  • Feature #219: Update documentation for custom AMIs

Bug Fixes

  • Bug #212: PyYAML 5.4.1 in source/requirements.txt does not install due to release of cython3.0
    Relax PyYAML version requirement by @cartalla in #215
  • Bug #216: Delete local build files that can contain tokens or stale values
    Remove creation of local AMI build-files by @cartalla in #217
  • Bug #223: module load sets environment variables that override values in the sbatch submission script
    Remove sbatch and srun defaults from modulefile by @cartalla in #224

Full Changelog: v2.3.3...v2.3.4

aws-eda-slurm-cluster v2.3.3

09 Sep 17:15
58f70e7

Choose a tag to compare

What's Changed

  • Update config files and fix errors found in testing new configs.
  • Clean up ansible-lint errors and warnings.
  • Paginate describe_instances when creating head node a record.
  • Add default MungeKeySecret.
  • Increase timeout for ssm command that configures submitters so slurm has time to compile.
  • Force slurm to be rebuilt for submitters of all os distributions even if they match the os of the cluster.
  • Paginate describe_instances in UpdateHeadNode lambda
  • Add check for min memory of 4 GB for slurm controller
  • Update documentation

New Features

  • Feature #207: Add --RESEnvironmentName to the installer to ease integration with Research and Engineering Studio (RES).

Bug Fixes

  • Bug #203: slurm_zfs.yml doesn't work
    slurm zfsyml doesnt work by @cartalla in #214
  • Bug #206: Default head node instance type for arm cluster is incorrect
    Set default head not instance type based on cluster architecture.

Full Changelog: v2.3.2...v2.3.3

aws-eda-slurm-cluster v2.3.2

09 Sep 16:57
a8b6555

Choose a tag to compare

What's Changed

Bug Fixes

  • Bug #200: Getting EC2 instance info fails
    Ignore pricing lists for capacity blocks by @cartalla in #201
  • Bug #202: Changing controller instance type doesn't cause cluster to be updated
    Update cluster when config file changes by @cartalla in #205

Full Changelog: v2.3.1...v2.3.2

aws-eda-slurm-cluster v2.3.1

30 Jan 23:52
eba2c5f

Choose a tag to compare

New Features

No new features in this release,

Bug Fixes

  • Bug #189 - [BUG] Deployment fails if RESEnvironmentName not configured
  • Bug #190 - [BUG] Handle case where ParallelCluster database stack isn't ready yet
  • Bug #191 - [BUG] Deployment fails if submitter security groups not configured
  • #193 - [DOCS] Baseline setup instructions from a clean AMI
  • Bug #196 - [BUG] install.sh fails if CDK bootstrap stack doesn't exist
  • Bug #197 - [BUG] Allow creation of stack in default VPC with only public subnets

aws-eda-slurm-cluster v2.3.0

10 Jan 17:18
d569cfb

Choose a tag to compare

New Features

No new features in this release, but the way that the config file is passed the Lambda that creates the cluster is significant enough
to justify a new minor version.

Bug Fixes

  • Bug #179 - Stack update fails because json payload too large
  • Bug #184 - Build files contain CFN tokens until stack is updated
  • Bug #184 - Rocky 8 AMI builds fail
  • Bug #186 - ParallelCluster deployment fails if AMI doesn't support instance type

aws-eda-slurm-cluster v2.2.0

21 Dec 20:54
12bb452

Choose a tag to compare

New Features

  • Feature #183 - Add support for ParallelCluster 3.8.0
    Adds support for custom munge key
    Adds support for Rocky Linux 8

aws-eda-slurm-cluster v2.1.0

16 Dec 03:20
3717730

Choose a tag to compare

New Features

  • Feature #177 - Build slurm for distribution of the submitter
  • Feature #181 - Automate configuration of RES cluster manager and submitters

Some pretty significant updates to automate RES configuration and deconfiguration when the cluster is deleted.

Bug Fixes

  • Bug #160 - Reduce IAM permissions for CreateParallelCluster lambda
  • Bug #178 - Playbook updates don't get updated correctly
  • Bug #180 - Cluster not updated when scripts saved in s3 are updated

aws-eda-slurm-cluster v2.0.3

28 Nov 19:45
13bc31e

Choose a tag to compare

New Features

  • Feature #170 - Integrate with RES

Bug Fixes

  • Bug #167 - Modulefile not created
  • Bug #172 - When deconfiguring submitter, fstab entry isn't removed