AWS Elasticsearch

cloud.gov offers aws-elasticsearch as a service hosted in AWS Elasticsearch and Opensearch.

Plans

Plan Name	Instance Type	Description	Number of nodes
`es-dev`	t3.small.search	Single data node for non-prod use only	1
`es-medium`	c5.large.search	3 Primary and 2 Data node cluster	5
`es-medium-ha`	c5.large.search	3 Primary and 4 Data node cluster	7
`es-large`	c5.xlarge.search	3 Primary and 2 Data node cluster	5
`es-large-ha`	c5.xlarge.search	3 Primary and 4 Data node cluster	7
`es-xlarge`	c5.2xlarge.search	3 Primary and 2 Data node cluster	5
`es-xlarge-ha`	c5.2xlarge.search	3 Primary and 4 Data node cluster	7
`es-2xlarge-gp`	m5.2xlarge.search	3 Primary and 2 Data node cluster	5
`es-2xlarge-gp-ha`	m5.2xlarge.search	3 Primary and 4 Data node cluster	7
`es-4xlarge-gp`	m5.4xlarge.search	3 Primary and 2 Data node cluster	5
`es-4xlarge-gp-ha`	m5.4xlarge.search	3 Primary and 4 Data node cluster	7
`es-8xlarge-gp`	m5.4xlarge.search	3 Primary and 4 Data node cluster (by request)	7
`es-8xlarge-gp-ha`	m5.4xlarge.search	3 Primary and 8 Data node cluster (by request)	11
`es-12xlarge-gp`	m5.12xlarge.search	3 Primary and 2 Data node cluster (by request)	5
`es-12xlarge-gp-ha`	m5.12xlarge.search	3 Primary and 4 Data node cluster (by request)	7
`es-24xlarge-gp`	m5.12xlarge.search	3 Primary and 4 Data node cluster (by request)	7
`es-24xlarge-gp-ha`	m5.12xlarge.search	3 Primary and 8 Data node cluster (by request)	11

Service name is aws-elasticsearch for all plans.

When to use

This service is geared toward applications that need to provide search capability, or interact with indexed data. Although Elasticsearch is often used as part of the ELK logging stack (Elasticsearch + Logstash + Kibana), this service does not include Kibana, and isn’t suitable as a component of a logging solution. To find out more about logging on the cloud.gov platform, please see the section on logs.

Create an instance

To create a service instance run the following command:

cf create-service aws-elasticsearch {service-plan} {service-name}

For example, to create a new instance using the es-medium plan named my-elastic-service , you would enter the following at the cf CLI:

cf create-service aws-elasticsearch es-medium my-elastic-service

Note: AWS Elasticsearch creation times will vary and is outside of Cloud.gov’s control. AWS says approximately 15-30 mins per node.

Setting optional parameters

name	required	description	example
`advanced_options`	false	map for advanced options	see below
`ElasticsearchVersion`	false	Specifies a supported major version in search (must be in “”)	`OpenSearch_2.11`

These are the current supported major versions for Elasticsearch version:

Elasticsearch_7.10
OpenSearch_1.3
OpenSearch_2.11

If you don’t specify any option, currently Elasticsearch_7.10 is used by default.

If you want to specify the Elasticsearch version:

cf create-service aws-elasticsearch \
    ${SERVICE_PLAN_NAME} \
    ${SERVICE_NAME} \
    -c '{"ElasticsearchVersion": "OpenSearch_2.11"}'

Advanced Options

These are advanced tuning options that can have significant performance or behavior effects on your cluster. They are specified as key/value pairs under the advanced_options map in the core parameters. Note: although these all represent numbers, they are all specified as strings. Additionally, although they are dotted, they are not nested keys.

Name	description	default
indices.fielddata.cache.size	percentage of JVM heap allocated to field data	“20”
indices.query.bool.max_clause_count	maximum number of clauses allowed in a Lucene boolean query	“1024”

Here are some examples of how to set these values:

cf create-service aws-elasticsearch es-medium my-es-service-1 \
    -c '{"advanced_options": {"indices.fielddata.cache.size": "21"}}'

cf create-service aws-elasticsearch es-medium my-es-service-2 \
    -c '{"advanced_options": {"indices.query.bool.max_clause_count": "1025"}}'

cf create-service aws-elasticsearch es-medium my-es-service-3 \
    -c '{"advanced_options": {"indices.query.bool.max_clause_count": "1025", "indices.fielddata.cache.size": "21"}}'

Note - if you are using the cf CLI utility on Windows, see the examples section of the Cloud Foundry documentation for specific formatting of parameters.

Shard/replica configuration for high availability

The medium and large plans are mainly geared toward customers that need a single index and have coverage with 2 data nodes. For customers wanting more coverage and more indexes, the medium-ha nd large-ha plans scale the cluster to 4 data nodes to offer high availability (HA).

For additional information on configuring your service for high availability when using anything other than the es-dev plan, you can refer to this article from the Elasticsearch documentation for an overview of how clusters work.

AWS specific information can be found in the AWS Developer Guide. Note - AWS recently changed the name of this service to AWS OpenSearch.

Update an instance

Setting optional parameters

name	required	description	example
`volume_type`	false	Specifies the EBS storage volume type. Valid options: `gp3`	`gp3`

Changing instance plans

Please note that you cannot use the broker to update the instance plan at this time.

Updating storage volume type

To update an existing Elasticsearch/Opensearch instance to use gp3 storage volumes:

cf update-service ${SERVICE_NAME} \
  -c '{"volume_type": "gp3"}'

Updating the volume type to gp3 for your Elasticsearch/Opensearch instance will trigger a blue/green deployment and some amount of downtime while the instance updates.

Connecting to your service instance

If you need to directly access your service instance from your local environment, please read this knowledge base article for more information.

Managing backups

By default AWS will create hourly snapshot backups of your Elasticsearch service instance and retain those snapshots for 14 days - 336 total snapshots are retained. These snapshots are only accessible to cluster restore purposes only and can not be exported.

For customers that would like to import or export their Elasticsearch data, this can be done following the AWS Elasticsearch Manual Snapshot Process

As part of the process of using manual snapshots, customers will need to assign an existing S3 bucket and the broker will provide a AWS policy ARN to use with the manual S3 snapshot process

Create an instance of the S3 service:
```
cf create-service s3 basic my-s3-bucket
```
Create a service key to access S3 credentials:
```
cf create-service-key my-s3-bucket my-key
```

Get your s3 bucket name:

s3_credentials=$(cf service-key my-s3-bucket my-key | tail -n +3)
s3_bucket=$(echo "${s3_credentials}" | jq -r '.credentials.bucket')

Assign your new bucket to your Elasticsearch instance:

cf create-service-key my-elastic-service my-key -c '{"bucket":"${s3_bucket}"}'

Find your AWS ARN for your AWS policy to reference for your backup:

es_arn=$(cf service-key my-elastic-service my-key | tail -n +3)
snapshotRoleARN=$(echo "${es_arn}" | jq -r '.credentials.snapshotRoleARN')

Once you have your s3 bucket connected and have the snapshotRoleARN you can then inside your application connect to the AWS ES host and register your s3 repository endpoint and then perform your snapshot export/import operations using AWS signed HTTP headers. Due to the nature of AWS Signature Calculations there is no direct way with curl to perform these operations and it’s best left to client libraries in a programing language your applications are written in.

cloud.gov does offer a code sample repository on Github - aws-elasticsearch-example that shows an example in Python on how to interact with the new ES service using signed headers. Our customers are encouraged to submit PRs of other examples to share with fellow customers.

Encryption

Every non-dev AWS Elasticsearch instance configured through cloud.gov is encrypted at rest. We use the industry standard AES-256 encryption algorithm to encrypt your data on the server that hosts your AWS Elasticsearch instance.

Service Name	Plan Name	Encryption at Rest
`aws-elasticsearch`	`es-dev`	No
`aws-elasticsearch`	`es-medium`	Yes
`aws-elasticsearch`	`es-medium-ha`	Yes

Note: If you have access to larger service plans, they will mirror same settings as es-medium or es-medium-ha .

Rotating credentials

You can rotate credentials by creating a new instance and deleting the existing instance. If this is not an option, email cloud.gov support to request rotating the credentials manually.

The broker in GitHub

You can find the broker here: https://github.com/cloud-gov/aws-broker.

AWS Elasticsearch

Table of Contents

Plans

When to use

Create an instance

Setting optional parameters

Advanced Options

Shard/replica configuration for high availability

Update an instance

Setting optional parameters

Changing instance plans

Updating storage volume type

Connecting to your service instance

Managing backups

Encryption

Rotating credentials

The broker in GitHub