AWS Elastic Search Service

Amazon Elasticsearch Service is a fully managed service that is easy to deploy, easy to secure, and cost effective at scale.

Features include:

Support for the tools that build, monitor, and troubleshoot your applications at the scale that you need.
Support for open source Elasticsearch APIs, managed Kibana, integration with Logstash and other AWS services, and built-in alerting and SQL querying.
Pay for only for what is used with no upfront costs or usage requirements. For example, you can get the ELK stack that you need, without the operational overhead.

Use the AWS public cloud integration to discover and collect metrics against the AWS service.

External reference

What Is Amazon Elasticsearch Service?

Setup

To set up the AWS integration and discover the AWS service, go to AWS Integration Discovery Profile and select Elastic Search Service.

Event support

CloudTrail event support

Supported
Configurable in OpsRamp AWS Integration Discovery Profile.

CloudWatch alarm support

Not Supported

Supported metrics

OpsRamp Metric	Metric Display Name	Unit	Aggregation Type
aws_es_Nodes Number of nodes in the Amazon ES cluster.	Nodes	Count	Average
aws_es_SearchableDocuments Total number of searchable documents across all indices in the cluster.	SearchableDocuments	Count	Average
aws_es_DeletedDocuments Total number of deleted documents across all indices in the cluster.	DeletedDocuments	Count	Average
aws_es_CPUUtilization Maximum percentage of CPU resources used for data nodes in the cluster.	CPUUtilization.es	Percent	Average
aws_es_FreeStorageSpace Free space, in megabytes, for all data nodes in the cluster.	FreeStorageSpace.es	Megabytes	Minimum
aws_es_ClusterUsedSpace Total used space, in megabytes, for a cluster.	ClusterUsedSpace	Megabytes	Minimum
aws_es_ClusterIndexWritesBlocked Indicates whether the cluster is accepting or blocking incoming write requests.	ClusterIndexWritesBlocked	Count	Maximum
aws_es_JVMMemoryPressure Maximum percentage of the Java heap used for all data nodes in the cluster.	JVMMemoryPressure	Percent	Maximum
aws_es_AutomatedSnapshotFailure Number of failed automated snapshots for the cluster.	AutomatedSnapshotFailure	Count	Maximum
aws_es_CPUCreditBalance Remaining CPU credits available for data nodes in the cluster.	CPUCreditBalance.es	Count	Minimum
aws_es_KibanaHealthyNodes Health check for Kibana.	KibanaHealthyNodes	Count	Minimum
aws_es_MasterCPUUtilization Maximum percentage of CPU resources used by the dedicated master nodes.	MasterCPUUtilization	Percent	Average
aws_es_MasterJVMMemoryPressure Maximum percentage of the Java heap used for all dedicated master nodes in the cluster.	MasterJVMMemoryPressure	Percent	Maximum
aws_es_MasterCPUCreditBalance Remaining CPU credits available for dedicated master nodes in the cluster.	MasterCPUCreditBalance	Count	Minimum
aws_es_MasterReachableFromNode Health check for MasterNotDiscovered exceptions. Value of 1 indicates normal behavior.	MasterReachableFromNode	Count	Minimum
aws_es_ClusterStatus_green_es Indicates that all index shards are allocated to nodes in the cluster.	ClusterStatus.green.es	Count	Maximum
aws_es_ClusterStatus_yellow_es Indicates that the primary shards for all indices are allocated to nodes in a cluster, but the replica shards for at least one index are not.	ClusterStatus.yellow.es	Count	Maximum
aws_es_ClusterStatus_red_es Indicates that the primary and replica shards of at least one index are not allocated to nodes in a cluster.	ClusterStatus.red.es	Count	Maximum
aws_es_2xx Number of requests to the domain that resulted in the given HTTP response code 2xx.	2xx	Count	Sum
aws_es_3xx Number of requests to the domain that resulted in the given HTTP response code 3xx.	3xx	Count	Sum
aws_es_4xx Number of requests to the domain that resulted in the given HTTP response code 4xx.	4xx	Count	Sum
aws_es_5xx Number of requests to the domain that resulted in the given HTTP response code 5xx.	5xx	Count	Sum
aws_es_AlertingDegraded Value of 1 means that the alerting index is red or one or more nodes is not on schedule. Value of 0 indicates normal behavior.	AlertingDegraded	Count	Maximum
aws_es_AlertingIndexExists Value of 1 means the .opendistro-alerting-config index exists. Value of 0 means it does not. Until you use the alerting feature for the first time, this value remains 0.	AlertingIndexExists	Count	Maximum
aws_es_AlertingIndexStatus_green Health of the index. Value of 1 means green. Value of 0 means that the index doesnt exist or isnt green.	AlertingIndexStatus.green	Count	Maximum
aws_es_AlertingIndexStatus_red Health of the index. Value of 1 means red. Value of 0 means that the index does notexist or is not red.	AlertingIndexStatus.red	Count	Maximum
aws_es_AlertingIndexStatus_yellow Health of the index. Value of 1 means yellow. Value of 0 means that the index does not exist or is not yellow.	AlertingIndexStatus.yellow	Count	Maximum
aws_es_AlertingNodesNotOnSchedule Value of 1 means some jobs are not running on schedule. Value of 0 means that all alerting jobs are running on schedule (or that no alerting jobs exist). Check the Amazon ES console or make a _nodes/stats request to see if any nodes show high resource usage.	AlertingNodesNotOnSchedule	Count	Maximum
aws_es_AlertingNodesOnSchedule Value of 1 means that all alerting jobs are running on schedule (or that no alerting jobs exist). Value of 0 means some jobs are not running on schedule.	AlertingNodesOnSchedule	Count	Maximum
aws_es_SQLUnhealthy Value of 1 indicates that, in response to certain requests, the SQL plugin is returning 5xx response codes or passing invalid query DSL to Elasticsearch. Other requests should continue to succeed. Value of 0 indicates no recent failures. If a sustained value of 1 is displayed, troubleshoot the requests that clients are making to the plugin.	SQLUnhealthy	Count	Maximum
aws_es_SQLRequestCount Number of requests to the Open Distro SQL API.	SQLRequestCount	Count	Sum
aws_es_AlertingScheduledJobEnabled Value of 1 means that the opendistro.scheduled_jobs.enabled cluster setting is true. Value of 0 means it is false and scheduled jobs are disabled.	AlertingScheduledJobEnabled	Count	Maximum
aws_es_SQLFailedRequestCountBySysErr Number of requests to the Open Distro SQL API API that failed due to a server problem or feature limitation. For example, a request might return HTTP status code 503 due to a VerificationException.	SQLFailedRequestCountBySysErr	Count	Sum
aws_es_SQLFailedRequestCountByCusErr Number of requests to the Open Distro SQL API that failed due to a client issue. For example, a request might return HTTP status code 400 due to an IndexNotFoundException.	SQLFailedRequestCountByCusErr	Count	Sum