Tutorial: Accessing GEOS-Chem Input Data using AWS CLI
This tutorial will guide you through the process of accessing and using the GEOS-Chem Input Data with AWS CLI. Alternatively, you can access the data via AWS S3 Explorer.
The workflow is
-
This step only has to be done once.
Run a GEOS-Chem Classic, GCHP, or HEMCO standalone simulation.
Install and configure AWS CLI
If you have already installed and configured the AWS CLI previously, continue to Data access.
Step 1: Install AWS CLI
Follow the instructions to install the AWS CLI from the AWS CLI User Guide.
Step 2: Configure AWS CLI
Run the following command to configure AWS CLI with your credentials:
$ aws configure
For instructions on aws configure, refer to the Configure the AWS CLI .
Access and download data
Step 1: List available data
To view the available data in the GEOS-Chem Input Data S3 bucket, use the following command:
$ aws s3 ls s3://geos-chem/
or without AWS account required
$ aws s3 ls --no-sign-request s3://geos-chem/
Step 3: Download the data
Tip
If you are using GEOS-Chem Classic or the HEMCO standalone model, you can download data with a dry-run simulation, while still using the AWS CLI data transfer protocol.
Once you have located the data you need, you can download it to your local cluster or an EC2 instance. For example,
$ aws s3 cp s3://geos-chem/GEOS_0.5x0.625/MERRA2/2024/05 ./ --recursive
This command will copy the data to your current path.
Run simulations using downloaded data
Once you have downloaded the data from the GEOS-Chem Input Data portal to your computer system or EC2 instance, you may run a GEOS-Chem Classic, GCHP, or HEMCO standalone simulation. Please refer to the relevant user guide listed below.
Running GCHP on AWS
If you wish to use the computing resources on AWS to run GCHP and are seeking for an AMI, feel free to check our Set up AWS ParallelCluster guide.