Version: Next

Telemetry

Integrating Metrices through Prometheus-exports can better seamlessly connect to related monitoring platforms such as Prometheus and Grafana, improving the ability to monitor and alarm of the SeaTunnel cluster.

You can configure telemetry's configurations in the seatunnel.yaml file.

The following is an example declarative configuration.

seatunnel:
  engine:
    telemetry:
      metric:
        enabled: true # Whether open metrics export

Metrics

The metric text of prometheus,which get from http://{instanceHost}:5801/hazelcast/rest/instance/metrics.

The metric text of openMetrics,which get from http://{instanceHost}:5801/hazelcast/rest/instance/openmetrics.

Available metrics include the following categories.

Note: All metrics both have the same labelName cluster, that's value is the config of hazelcast.cluster-name.

Node Metrics

MetricName	Type	Labels	DESCRIPTION
cluster_info	Gauge	hazelcastVersion, the version of hazelcast. master, seatunnel master address.	Cluster info
cluster_time	Gauge	hazelcastVersion, the version of hazelcast.	Cluster time
node_count	Gauge	-	Cluster node total count
node_state	Gauge	address, server instance address,for example: "127.0.0.1:5801"	Whether is up of seatunnel node
hazelcast_executor_executedCount	Gauge	type, the type of executor, including: "async" "client" "clientBlocking" "clientQuery" "io" "offloadable" "scheduled" "system"	The hazelcast executor executedCount of seatunnel cluster node
hazelcast_executor_isShutdown	Gauge	type, the type of executor, including: "async" "client" "clientBlocking" "clientQuery" "io" "offloadable" "scheduled" "system"	The hazelcast executor isShutdown of seatunnel cluster node
hazelcast_executor_isTerminated	Gauge	type, the type of executor, including: "async" "client" "clientBlocking" "clientQuery" "io" "offloadable" "scheduled" "system"	The hazelcast executor isTerminated of seatunnel cluster node
hazelcast_executor_maxPoolSize	Gauge	type, the type of executor, including: "async" "client" "clientBlocking" "clientQuery" "io" "offloadable" "scheduled" "system"	The hazelcast executor maxPoolSize of seatunnel cluster node
hazelcast_executor_poolSize	Gauge	type, the type of executor, including: "async" "client" "clientBlocking" "clientQuery" "io" "offloadable" "scheduled" "system"	The hazelcast executor poolSize of seatunnel cluster node
hazelcast_executor_queueRemainingCapacity	Gauge	type, the type of executor, including: "async" "client" "clientBlocking" "clientQuery" "io" "offloadable" "scheduled" "system"	The hazelcast executor queueRemainingCapacity of seatunnel cluster node
hazelcast_executor_queueSize	Gauge	type, the type of executor, including: "async" "client" "clientBlocking" "clientQuery" "io" "offloadable" "scheduled" "system"	The hazelcast executor queueSize of seatunnel cluster node
hazelcast_partition_partitionCount	Gauge	-	The partitionCount of seatunnel cluster node
hazelcast_partition_activePartition	Gauge	-	The activePartition of seatunnel cluster node
hazelcast_partition_isClusterSafe	Gauge	-	Whether is cluster safe of partition
hazelcast_partition_isLocalMemberSafe	Gauge	-	Whether is local member safe of partition

Thread Pool Status

MetricName	Type	Labels	DESCRIPTION
job_thread_pool_activeCount	Gauge	address, server instance address,for example: "127.0.0.1:5801"	The activeCount of seatunnel coordinator job's executor cached thread pool
job_thread_pool_corePoolSize	Gauge	address, server instance address,for example: "127.0.0.1:5801"	The corePoolSize of seatunnel coordinator job's executor cached thread pool
job_thread_pool_maximumPoolSize	Gauge	address, server instance address,for example: "127.0.0.1:5801"	The maximumPoolSize of seatunnel coordinator job's executor cached thread pool
job_thread_pool_poolSize	Gauge	address, server instance address,for example: "127.0.0.1:5801"	The poolSize of seatunnel coordinator job's executor cached thread pool
job_thread_pool_queueTaskCount	Gauge	address, server instance address,for example: "127.0.0.1:5801"	The queueTaskCount of seatunnel coordinator job's executor cached thread pool
job_thread_pool_completedTask_total	Counter	address, server instance address,for example: "127.0.0.1:5801"	The completedTask of seatunnel coordinator job's executor cached thread pool
job_thread_pool_task_total	Counter	address, server instance address,for example: "127.0.0.1:5801"	The taskCount of seatunnel coordinator job's executor cached thread pool
job_thread_pool_rejection_total	Counter	address, server instance address,for example: "127.0.0.1:5801"	The rejectionCount of seatunnel coordinator job's executor cached thread pool

Job info detail

MetricName	Type	Labels	DESCRIPTION
job_count	Gauge	type, the type of job, including: "canceled" "cancelling" "created" "failed" "failing" "finished" "running" "scheduled"	All job counts of seatunnel cluster

JVM Metrics

MetricName	Type	Labels	DESCRIPTION
jvm_threads_current	Gauge	-	Current thread count of a JVM
jvm_threads_daemon	Gauge	-	Daemon thread count of a JVM
jvm_threads_peak	Gauge	-	Peak thread count of a JVM
jvm_threads_started_total	Counter	-	Started thread count of a JVM
jvm_threads_deadlocked	Gauge	-	Cycles of JVM-threads that are in deadlock waiting to acquire object monitors or ownable synchronizers
jvm_threads_deadlocked_monitor	Gauge	-	Cycles of JVM-threads that are in deadlock waiting to acquire object monitors
jvm_threads_state	Gauge	state, the state of jvm thread, including: "NEW" "TERMINATED" "RUNNABLE" "BLOCKED" "WAITING" "TIMED_WAITING" "UNKNOWN"	Current count of threads by state
jvm_classes_currently_loaded	Gauge	-	The number of classes that are currently loaded in the JVM
jvm_classes_loaded_total	Counter	-	The total number of classes that have been loaded since the JVM has started execution
jvm_classes_unloaded_total	Counter	-	The total number of classes that have been unloaded since the JVM has started execution
jvm_memory_pool_allocated_bytes_total	Counter	pool,including: "Code Cache" "PS Eden Space" "PS Old Ge" "PS Survivor Space" "Compressed Class Space" "Metaspace"	Total bytes allocated in a given JVM memory pool. Only updated after GC, not continuously
jvm_gc_collection_seconds_count	Summary	gc,including: "PS Scavenge" "PS MarkSweep"	Time spent in a given JVM garbage collector in seconds
jvm_gc_collection_seconds_sum	Summary	gc,including: "PS Scavenge" "PS MarkSweep"	Time spent in a given JVM garbage collector in seconds
jvm_info	Gauge	runtime, for example: "Java(TM) SE Runtime Environment". vendor, for example: "Oracle Corporation". version ,for example: "1.8.0_212-b10"	VM version info
process_cpu_seconds_total	Counter	-	Total user and system CPU time spent in seconds
process_start_time_seconds	Gauge	-	Start time of the process since unix epoch in seconds
process_open_fds	Gauge	-	Number of open file descriptors
process_max_fds	Gauge	-	Maximum number of open file descriptors
jvm_memory_objects_pending_finalization	Gauge	-	The number of objects waiting in the finalizer queue
jvm_memory_bytes_used	Gauge	area, including: "heap" "noheap"	Used bytes of a given JVM memory area
jvm_memory_bytes_committed	Gauge	area, including: "heap" "noheap"	Committed (bytes) of a given JVM memory area
jvm_memory_bytes_max	Gauge	area, including:"heap" "noheap"	Max (bytes) of a given JVM memory area
jvm_memory_bytes_init	Gauge	area, including:"heap" "noheap"	Initial bytes of a given JVM memory area
jvm_memory_pool_bytes_used	Gauge	pool, including: "Code Cache" "PS Eden Space" "PS Old Ge" "PS Survivor Space" "Compressed Class Space" "Metaspace"	Used bytes of a given JVM memory pool
jvm_memory_pool_bytes_committed	Gauge	pool, including: "Code Cache" "PS Eden Space" "PS Old Ge" "PS Survivor Space" "Compressed Class Space" "Metaspace"	Committed bytes of a given JVM memory pool
jvm_memory_pool_bytes_max	Gauge	pool, including: "Code Cache" "PS Eden Space" "PS Old Ge" "PS Survivor Space" "Compressed Class Space" "Metaspace"	Max bytes of a given JVM memory pool
jvm_memory_pool_bytes_init	Gauge	pool, including: "Code Cache" "PS Eden Space" "PS Old Ge" "PS Survivor Space" "Compressed Class Space" "Metaspace"	Initial bytes of a given JVM memory pool
jvm_memory_pool_allocated_bytes_created	Gauge	pool, including: "Code Cache" "PS Eden Space" "PS Old Ge" "PS Survivor Space" "Compressed Class Space" "Metaspace"	Total bytes allocated in a given JVM memory pool. Only updated after GC, not continuously
jvm_memory_pool_collection_used_bytes	Gauge	pool, including: "PS Eden Space" "PS Old Ge" "PS Survivor Space"	Used bytes after last collection of a given JVM memory pool
jvm_memory_pool_collection_committed_bytes	Gauge	pool, including: "PS Eden Space" "PS Old Ge" "PS Survivor Space"	Committed after last collection bytes of a given JVM memory pool
jvm_memory_pool_collection_max_bytes	Gauge	pool, including: "PS Eden Space" "PS Old Ge" "PS Survivor Space"	Max bytes after last collection of a given JVM memory pool
jvm_memory_pool_collection_init_bytes	Gauge	pool, including: "PS Eden Space" "PS Old Ge" "PS Survivor Space"	Initial after last collection bytes of a given JVM memory pool
jvm_buffer_pool_used_bytes	Gauge	pool, including: "direct" "mapped"	Used bytes of a given JVM buffer pool
jvm_buffer_pool_capacity_bytes	Gauge	pool, including: "direct" "mapped"	Bytes capacity of a given JVM buffer pool
jvm_buffer_pool_used_buffers	Gauge	pool, including: "direct" "mapped"	Used buffers of a given JVM buffer pool

Cluster Monitoring By Prometheus & Grafana

Install Prometheus

For a guide on how to set up Prometheus server go to the Installation

Configuration Prometheus

Add seatunnel instance metric exports into /etc/prometheus/prometheus.yaml. For example:

global:
  # How frequently to scrape targets from this job.
  scrape_interval: 15s
scrape_configs:
  # The job name assigned to scraped metrics by default.
  - job_name: 'seatunnel'
    scrape_interval: 5s
    # Metrics export path 
    metrics_path: /hazelcast/rest/instance/metrics
    # List of labeled statically configured targets for this job.
    static_configs:
      # The targets specified by the static config.
      - targets: [ 'localhost:5801' ]
      # Labels assigned to all metrics scraped from the targets.
      # labels: [<labelName>:<labelValue>]

Install Grafana

For a guide on how to set up Grafana server go to the Installation

Monitoring Dashboard

Add Prometheus DataSource on Grafana.
- Import Seatunnel Cluster monitoring dashboard by Dashboard JSON into Grafana.

The effect image of the dashboard

Telemetry

Metrics​

Node Metrics​

Thread Pool Status​

Job info detail​

JVM Metrics​

Cluster Monitoring By Prometheus & Grafana​

Install Prometheus​

Configuration Prometheus​

Install Grafana​

Monitoring Dashboard​