Version: Next

MySQL CDC

MySQL CDC source connector

Support Those Engines

SeaTunnel Zeta
Flink

Description

The MySQL CDC connector allows for reading snapshot data and incremental data from MySQL database. This document describes how to set up the MySQL CDC connector to run SQL queries against MySQL databases.

Key features

Supported DataSource Info

Datasource	Supported versions	Driver	Url	Maven
MySQL	MySQL: 5.5, 5.6, 5.7, 8.0.x RDS MySQL: 5.6, 5.7, 8.0.x	com.mysql.cj.jdbc.Driver	jdbc:mysql://localhost:3306/test	https://mvnrepository.com/artifact/mysql/mysql-connector-java/8.0.28

Using Dependency

Install Jdbc Driver

For Flink Engine

You need to ensure that the jdbc driver jar package has been placed in directory ${SEATUNNEL_HOME}/plugins/.

For SeaTunnel Zeta Engine

You need to ensure that the jdbc driver jar package has been placed in directory ${SEATUNNEL_HOME}/lib/.

Creating MySQL user

You have to define a MySQL user with appropriate permissions on all databases that the Debezium MySQL connector monitors.

Create the MySQL user:

mysql> CREATE USER 'user'@'localhost' IDENTIFIED BY 'password';

Grant the required permissions to the user:

mysql> GRANT SELECT, RELOAD, SHOW DATABASES, REPLICATION SLAVE, REPLICATION CLIENT ON *.* TO 'user' IDENTIFIED BY 'password';

Finalize the user’s permissions:

mysql> FLUSH PRIVILEGES;

Enabling the MySQL Binlog

You must enable binary logging for MySQL replication. The binary logs record transaction updates for replication tools to propagate changes.

Check whether the log-bin option is already on:

mysql> show variables where variable_name in ('log_bin', 'binlog_format', 'binlog_row_image', 'gtid_mode', 'enforce_gtid_consistency');
+--------------------------+----------------+
| Variable_name            | Value          |
+--------------------------+----------------+
| binlog_format            | ROW            |
| binlog_row_image         | FULL           |
| enforce_gtid_consistency | ON             |
| gtid_mode                | ON             |
| log_bin                  | ON             |
+--------------------------+----------------+

If the value of log_bin is not on, configure your MySQL server configuration file($MYSQL_HOME/mysql.cnf) with the following properties, which are described in the table below:

# Enable binary replication log and set the prefix, expiration, and log format.
# The prefix is arbitrary, expiration can be short for integration tests but would
# be longer on a production system. Row-level info is required for ingest to work.
# Server ID is required, but this will vary on production systems
server-id         = 223344
log_bin           = mysql-bin
expire_logs_days  = 10
binlog_format     = row
# mysql 5.6+ requires binlog_row_image to be set to FULL
binlog_row_image  = FULL

# optional enable gtid mode
# mysql 5.6+ requires gtid_mode to be set to ON, but not required by mysql 8.0+
gtid_mode = on
enforce_gtid_consistency = on

Restart MySQL Server

/etc/inint.d/mysqld restart

Confirm your changes by checking the binlog status once more:

MySQL 5.5:

mysql> show variables where variable_name in ('log_bin', 'binlog_format', 'binlog_row_image', 'gtid_mode', 'enforce_gtid_consistency');
+--------------------------+----------------+
| Variable_name            | Value          |
+--------------------------+----------------+
| binlog_format            | ROW            |
| log_bin                  | ON             |
+--------------------------+----------------+

MySQL 5.6+:

mysql> show variables where variable_name in ('log_bin', 'binlog_format', 'binlog_row_image', 'gtid_mode', 'enforce_gtid_consistency');
+--------------------------+----------------+
| Variable_name            | Value          |
+--------------------------+----------------+
| binlog_format            | ROW            |
| binlog_row_image         | FULL           |
| enforce_gtid_consistency | ON             |
| gtid_mode                | ON             |
| log_bin                  | ON             |
+--------------------------+----------------+

MySQL 8.0+:

show variables where variable_name in ('log_bin', 'binlog_format', 'binlog_row_image', 'gtid_mode', 'enforce_gtid_consistency')
+--------------------------+----------------+
| Variable_name            | Value          |
+--------------------------+----------------+
| binlog_format            | ROW            |
| binlog_row_image         | FULL           |
| enforce_gtid_consistency | OFF            |
| gtid_mode                | OFF            |
| log_bin                  | ON             |
+--------------------------+----------------+  
     

Notes

Setting up MySQL session timeouts

When an initial consistent snapshot is made for large databases, your established connection could timeout while the tables are being read. You can prevent this behavior by configuring interactive_timeout and wait_timeout in your MySQL configuration file.

interactive_timeout: The number of seconds the server waits for activity on an interactive connection before closing it. See MySQL’s documentation for more details.
wait_timeout: The number of seconds the server waits for activity on a non-interactive connection before closing it. See MySQL’s documentation for more details.

For more database settings see Debezium MySQL Connector

Data Type Mapping

Mysql Data Type	SeaTunnel Data Type
BIT(1) TINYINT(1)	BOOLEAN
TINYINT	TINYINT
TINYINT UNSIGNED SMALLINT	SMALLINT
SMALLINT UNSIGNED MEDIUMINT MEDIUMINT UNSIGNED INT INTEGER YEAR	INT
INT UNSIGNED INTEGER UNSIGNED BIGINT	BIGINT
BIGINT UNSIGNED	DECIMAL(20,0)
DECIMAL(p, s) DECIMAL(p, s) UNSIGNED NUMERIC(p, s) NUMERIC(p, s) UNSIGNED	DECIMAL(p,s)
FLOAT FLOAT UNSIGNED	FLOAT
DOUBLE DOUBLE UNSIGNED REAL REAL UNSIGNED	DOUBLE
CHAR VARCHAR TINYTEXT MEDIUMTEXT TEXT LONGTEXT ENUM JSON ENUM	STRING
DATE	DATE
TIME(s)	TIME(s)
DATETIME TIMESTAMP(s)	TIMESTAMP(s)
BINARY VARBINAR BIT(p) TINYBLOB MEDIUMBLOB BLOB LONGBLOB GEOMETRY	BYTES

Source Options

Name	Type	Required	Default	Description
url	String	Yes	-	The URL of the JDBC connection. Refer to a case: `jdbc:mysql://localhost:3306/test`.
username	String	Yes	-	Name of the database to use when connecting to the database server.
password	String	Yes	-	Password to use when connecting to the database server.
database-names	List	No	-	Database name of the database to monitor.
database-pattern	String	No	.*	The database names RegEx of the database to capture, for example: `database_prefix.*`.
table-names	List	Yes	-	Table name of the database to monitor. The table name needs to include the database name, for example: `database_name.table_name`
table-pattern	String	Yes	-	The table names RegEx of the database to capture. The table name needs to include the database name, for example: `database.\\.table_.`
table-names-config	List	No	-	Table config list. for example: [{"table": "db1.schema1.table1","primaryKeys": ["key1"],"snapshotSplitColumn": "key2"}]
startup.mode	Enum	No	INITIAL	Optional startup mode for MySQL CDC consumer, valid enumerations are `initial`, `earliest`, `latest` , `specific` and `timestamp`. `initial`: Synchronize historical data at startup, and then synchronize incremental data. `earliest`: Startup from the earliest offset possible. `latest`: Startup from the latest offset. `specific`: Startup from user-supplied specific offsets. `timestamp`: Startup from user-supplied timestamp.
startup.specific-offset.file	String	No	-	Start from the specified binlog file name. Note, This option is required when the `startup.mode` option used `specific`.
startup.specific-offset.pos	Long	No	-	Start from the specified binlog file position. Note, This option is required when the `startup.mode` option used `specific`.
startup.timestamp	Long	No	-	Start from the specified timestamp. Note, This option is required when the `startup.mode` option used `timestamp`.
stop.mode	Enum	No	NEVER	Optional stop mode for MySQL CDC consumer, valid enumerations are `never`, `latest` or `specific`. `never`: Real-time job don't stop the source. `latest`: Stop from the latest offset. `specific`: Stop from user-supplied specific offset.
stop.specific-offset.file	String	No	-	Stop from the specified binlog file name. Note, This option is required when the `stop.mode` option used `specific`.
stop.specific-offset.pos	Long	No	-	Stop from the specified binlog file position. Note, This option is required when the `stop.mode` option used `specific`.
snapshot.split.size	Integer	No	8096	The split size (number of rows) of table snapshot, captured tables are split into multiple splits when read the snapshot of table.
snapshot.fetch.size	Integer	No	1024	The maximum fetch size for per poll when read table snapshot.
server-id	String	No	-	A numeric ID or a numeric ID range of this database client, The numeric ID syntax is like `5400`, the numeric ID range syntax is like '5400-5408'. Every ID must be unique across all currently-running database processes in the MySQL cluster. This connector joins the MySQL cluster as another server (with this unique ID) so it can read the binlog. By default, a random number is generated between 6500 and 2,148,492,146, though we recommend setting an explicit value.
server-time-zone	String	No	UTC	The session time zone in database server. If not set, then ZoneId.systemDefault() is used to determine the server time zone.
connect.timeout.ms	Duration	No	30000	The maximum time that the connector should wait after trying to connect to the database server before timing out.
connect.max-retries	Integer	No	3	The max retry times that the connector should retry to build database server connection.
connection.pool.size	Integer	No	20	The jdbc connection pool size.
chunk-key.even-distribution.factor.upper-bound	Double	No	100	The upper bound of the chunk key distribution factor. This factor is used to determine whether the table data is evenly distributed. If the distribution factor is calculated to be less than or equal to this upper bound (i.e., (MAX(id) - MIN(id) + 1) / row count), the table chunks would be optimized for even distribution. Otherwise, if the distribution factor is greater, the table will be considered as unevenly distributed and the sampling-based sharding strategy will be used if the estimated shard count exceeds the value specified by `sample-sharding.threshold`. The default value is 100.0.
chunk-key.even-distribution.factor.lower-bound	Double	No	0.05	The lower bound of the chunk key distribution factor. This factor is used to determine whether the table data is evenly distributed. If the distribution factor is calculated to be greater than or equal to this lower bound (i.e., (MAX(id) - MIN(id) + 1) / row count), the table chunks would be optimized for even distribution. Otherwise, if the distribution factor is less, the table will be considered as unevenly distributed and the sampling-based sharding strategy will be used if the estimated shard count exceeds the value specified by `sample-sharding.threshold`. The default value is 0.05.
sample-sharding.threshold	Integer	No	1000	This configuration specifies the threshold of estimated shard count to trigger the sample sharding strategy. When the distribution factor is outside the bounds specified by `chunk-key.even-distribution.factor.upper-bound` and `chunk-key.even-distribution.factor.lower-bound`, and the estimated shard count (calculated as approximate row count / chunk size) exceeds this threshold, the sample sharding strategy will be used. This can help to handle large datasets more efficiently. The default value is 1000 shards.
inverse-sampling.rate	Integer	No	1000	The inverse of the sampling rate used in the sample sharding strategy. For example, if this value is set to 1000, it means a 1/1000 sampling rate is applied during the sampling process. This option provides flexibility in controlling the granularity of the sampling, thus affecting the final number of shards. It's especially useful when dealing with very large datasets where a lower sampling rate is preferred. The default value is 1000.
exactly_once	Boolean	No	false	Enable exactly once semantic.
format	Enum	No	DEFAULT	Optional output format for MySQL CDC, valid enumerations are `DEFAULT`、`COMPATIBLE_DEBEZIUM_JSON`.
schema-changes.enabled	Boolean	No	false	Schema evolution is disabled by default. Now we only support `add column`、`drop column`、`rename column` and `modify column`.
debezium	Config	No	-	Pass-through Debezium's properties to Debezium Embedded Engine which is used to capture data changes from MySQL server.
int_type_narrowing	Boolean	No	true	Int type narrowing, if true, the tinyint(1) type will be narrowed to the boolean type if without loss of precision. Support for MySQL at now. Please refer to `int_type_narrowing` below
common-options		no	-	Source plugin common parameters, please refer to Source Common Options for details

int_type_narrowing

Int type narrowing, if true, the tinyint(1) type will be narrowed to the boolean type if without loss of precision. Support for MySQL at now.

eg:

int_type_narrowing = true

MySQL	SeaTunnel
TINYINT(1)	Boolean

int_type_narrowing = false

MySQL	SeaTunnel
TINYINT(1)	TINYINT

Task Example

Simple

Support multi-table reading

env {
  parallelism = 1
  job.mode = "STREAMING"
  checkpoint.interval = 10000
}

source {
  MySQL-CDC {
    url = "jdbc:mysql://localhost:3306/testdb"
    username = "root"
    password = "root@123"
    table-names = ["testdb.table1", "testdb.table2"]
    
    startup.mode = "initial"
  }
}

sink {
  Console {
  }
}

Support debezium-compatible format send to kafka

Must be used with kafka connector sink, see compatible debezium format for details

Support custom primary key for table

env {
  parallelism = 1
  job.mode = "STREAMING"
  checkpoint.interval = 10000
}

source {
  MySQL-CDC {
    url = "jdbc:mysql://localhost:3306/testdb"
    username = "root"
    password = "root@123"
    
    table-names = ["testdb.table1", "testdb.table2"]
    table-names-config = [
      {
        table = "testdb.table2"
        primaryKeys = ["id"]
      }
    ]
  }
}

sink {
  Console {
  }
}

Support schema evolution

env {
  # You can set engine configuration here
  parallelism = 5
  job.mode = "STREAMING"
  checkpoint.interval = 5000
  read_limit.bytes_per_second=7000000
  read_limit.rows_per_second=400
}

source {
  MySQL-CDC {
    server-id = 5652-5657
    username = "st_user_source"
    password = "mysqlpw"
    table-names = ["shop.products"]
    url = "jdbc:mysql://mysql_cdc_e2e:3306/shop"
    
    schema-changes.enabled = true
  }
}

sink {
  jdbc {
    url = "jdbc:mysql://mysql_cdc_e2e:3306/shop"
    driver = "com.mysql.cj.jdbc.Driver"
    user = "st_user_sink"
    password = "mysqlpw"
    generate_sink_sql = true
    database = shop
    table = mysql_cdc_e2e_sink_table_with_schema_change_exactly_once
    primary_keys = ["id"]
    is_exactly_once = true
    xa_data_source_class_name = "com.mysql.cj.jdbc.MysqlXADataSource"
  }
}

Support table-pattern for multi-table reading

table-pattern and table-names are mutually exclusive

env {
  # You can set engine configuration here
  parallelism = 1
  job.mode = "STREAMING"
  checkpoint.interval = 5000
  read_limit.bytes_per_second=7000000
  read_limit.rows_per_second=400
}

source {
  MySQL-CDC {
    server-id = 5652
    username = "st_user_source"
    password = "mysqlpw"
    database-pattern = "source.*"
    table-pattern = "source.*\\..*"
    url = "jdbc:mysql://mysql_cdc_e2e:3306"
  }
}

sink {
  Console {
  }
}

Changelog

Change Log

Change	Commit	Version
[Feature][MySQL CDC] MySQL cdc support start by time (#9735)	https://github.com/apache/seatunnel/commit/b6c5d941b0	2.3.12
[Feature][Core] Add plugin directory support for each connector (#9650)	https://github.com/apache/seatunnel/commit/4beb2b9336	2.3.12
[Feature][Connectors-v2] Support Mysql8.4+ for mysql-cdc (#9720)	https://github.com/apache/seatunnel/commit/e338743927	2.3.12
[improve] jdbc options (#9541)	https://github.com/apache/seatunnel/commit/d041e5fb32	2.3.12
[Feature][Connectors-v2] Optimize the size of CDC JAR Files (#9546)	https://github.com/apache/seatunnel/commit/1dd19c6823	2.3.12
[Feature][Connector-V2] Jdbc mysql support read tinyint(1) to byte(tinyint) (#9373)	https://github.com/apache/seatunnel/commit/7b87aa6f12	2.3.12
[Improve][CDC] Filter ddl for snapshot phase (#8911)	https://github.com/apache/seatunnel/commit/641cc72f2f	2.3.10
[Improve][CDC] Extract duplicate code (#8906)	https://github.com/apache/seatunnel/commit/b922bb90e6	2.3.10
[Improve] restruct connector common options (#8634)	https://github.com/apache/seatunnel/commit/f3499a6eeb	2.3.10
[Fix][mysql-cdc] Fix GTIDs on startup to correctly recover from checkpoint (#8528)	https://github.com/apache/seatunnel/commit/82e4096c08	2.3.10
[Feature][MySQL-CDC] Support database/table wildcards scan read (#8323)	https://github.com/apache/seatunnel/commit/2116843ce8	2.3.9
[Feature][Jdbc] Support sink ddl for postgresql (#8276)	https://github.com/apache/seatunnel/commit/353bbd21a1	2.3.9
[Feature][CDC] Add 'schema-changes.enabled' options (#8285)	https://github.com/apache/seatunnel/commit/8e29ecf54f	2.3.9
Revert "[Feature][Redis] Flush data when the time reaches checkpoint interval" and "[Feature][CDC] Add 'schema-changes.enabled' options" (#8278)	https://github.com/apache/seatunnel/commit/fcb2938286	2.3.9
[Feature][CDC] Add 'schema-changes.enabled' options (#8252)	https://github.com/apache/seatunnel/commit/d783f9447c	2.3.9
[Improve][dist]add shade check rule (#8136)	https://github.com/apache/seatunnel/commit/51ef800016	2.3.9
[Feature][Connector-V2]Jdbc chunk split add snapshotSplitColumn config #7794 (#7840)	https://github.com/apache/seatunnel/commit/b6c6dc0438	2.3.9
[Feature][Core] Support cdc task ddl restore for zeta (#7463)	https://github.com/apache/seatunnel/commit/8e322281ed	2.3.9
[Feature][Connector-v2] Support schema evolution for Oracle connector (#7908)	https://github.com/apache/seatunnel/commit/79406bcc2f	2.3.9
[Hotfix][CDC] Fix ddl duplicate execution error when config multi_table_sink_replica (#7634)	https://github.com/apache/seatunnel/commit/23ab3edbbb	2.3.8
[Hotfix][CDC] Fix package name spelling mistake (#7415)	https://github.com/apache/seatunnel/commit/469112fa64	2.3.8
[Hotfix][MySQL-CDC] Fix ArrayIndexOutOfBoundsException in mysql binlog read (#7381)	https://github.com/apache/seatunnel/commit/40c5f313eb	2.3.7
[Improve][Connector-V2] Support schema evolution for mysql-cdc and mysql-jdbc (#6929)	https://github.com/apache/seatunnel/commit/cf91e51fc7	2.3.6
[Hotfix][MySQL-CDC] Fix read gbk varchar chinese garbled characters (#7046)	https://github.com/apache/seatunnel/commit/4e4d2b8ee5	2.3.6
[Improve][CDC] Bump the version of debezium to 1.9.8.Final (#6740)	https://github.com/apache/seatunnel/commit/c3ac953524	2.3.6
[Improve][CDC] Close idle subtasks gorup(reader/writer) in increment phase (#6526)	https://github.com/apache/seatunnel/commit/454c339b9c	2.3.6
[Improve][JDBC Source] Fix Split can not be cancel (#6825)	https://github.com/apache/seatunnel/commit/ee3b7c3723	2.3.6
[Hotfix][Jdbc/CDC] Fix postgresql uuid type in jdbc read (#6684)	https://github.com/apache/seatunnel/commit/868ba4d7c7	2.3.6
[Improve][mysql-cdc] Support mysql 5.5 versions (#6710)	https://github.com/apache/seatunnel/commit/058f5594a3	2.3.6
[Improve][mysql-cdc] Fallback to desc table when show create table failed (#6701)	https://github.com/apache/seatunnel/commit/6f74663c08	2.3.6
[Improve][Jdbc] Add quote identifier for sql (#6669)	https://github.com/apache/seatunnel/commit/849d748d3d	2.3.5
[Fix][Connector-V2] Fix connector support SPI but without no args constructor (#6551)	https://github.com/apache/seatunnel/commit/5f3c9c36a5	2.3.5
[Improve][CDC-Connector]Fix CDC option rule. (#6454)	https://github.com/apache/seatunnel/commit/1ea27afa87	2.3.5
[Improve][CDC] Optimize memory allocation for snapshot split reading (#6281)	https://github.com/apache/seatunnel/commit/4856645837	2.3.5
[Improve][API] Unify type system api(data & type) (#5872)	https://github.com/apache/seatunnel/commit/b38c7edcc9	2.3.5
[Feature][CDC] Support custom table primary key (#6106)	https://github.com/apache/seatunnel/commit/1312a1dd27	2.3.4
[Feature][CDC] Support read no primary key table (#6098)	https://github.com/apache/seatunnel/commit/b42d78de3f	2.3.4
[Bug][CDC] Fix state recovery error when switching a single table to multiple tables (#5784)	https://github.com/apache/seatunnel/commit/37fcff347e	2.3.4
[Feature][formats][ogg] Support read ogg format message #4201 (#4225)	https://github.com/apache/seatunnel/commit/7728e241e8	2.3.4
[Improve][CDC] Clean unused code (#5785)	https://github.com/apache/seatunnel/commit/b5a66d3dbe	2.3.4
[Improve][Jdbc] Fix database identifier (#5756)	https://github.com/apache/seatunnel/commit/dbfc8a670a	2.3.4
[improve][mysql-cdc] Optimize the default value range of mysql server-id to reduce conflicts. (#5550)	https://github.com/apache/seatunnel/commit/5174639463	2.3.4
[Improve] Remove catalog tag for config file (#5645)	https://github.com/apache/seatunnel/commit/dc509aa080	2.3.4
[Improve][Pom] Add junit4 to the root pom (#5611)	https://github.com/apache/seatunnel/commit/7b4f7db2a2	2.3.4
[Improve] Refactor CatalogTable and add `SeaTunnelSource::getProducedCatalogTables` (#5562)	https://github.com/apache/seatunnel/commit/41173357f8	2.3.4
[Improve][connector-cdc-mysql] avoid listing tables under unnecessary databases (#5365)	https://github.com/apache/seatunnel/commit/3e5d018b35	2.3.4
[Improve][Docs] Refactor MySQL-CDC docs (#5302)	https://github.com/apache/seatunnel/commit/74530a0461	2.3.4
[Improve][CheckStyle] Remove useless 'SuppressWarnings' annotation of checkstyle. (#5260)	https://github.com/apache/seatunnel/commit/51c0d709ba	2.3.4
[Hotfix] Fix com.google.common.base.Preconditions to seatunnel shade one (#5284)	https://github.com/apache/seatunnel/commit/ed5eadcf73	2.3.3
[Imporve][CDC Base] Add a fast sampling method that supports character types (#5179)	https://github.com/apache/seatunnel/commit/c0422dbfeb	2.3.3
[improve][CDC Base] Add some split parameters to the optionRule (#5161)	https://github.com/apache/seatunnel/commit/94fd6755e6	2.3.3
[Improve][CDC] support exactly-once of cdc and fix the BinlogOffset comparing bug (#5057)	https://github.com/apache/seatunnel/commit/0e4190ab2e	2.3.3
[Feature][Connector-V2][CDC] Support string type shard fields. (#5147)	https://github.com/apache/seatunnel/commit/e1be9d7f8a	2.3.3
[Feature][CDC] Support tables without primary keys (with unique keys) (#163) (#5150)	https://github.com/apache/seatunnel/commit/32b7f2b690	2.3.3
[Feature][Connector-V2][mysql cdc] Conversion of tinyint(1) to bool is supported (#5105)	https://github.com/apache/seatunnel/commit/86b1b7e31a	2.3.3
[Feature][connector-v2][mongodbcdc]Support source mongodb cdc (#4923)	https://github.com/apache/seatunnel/commit/d729fcba4c	2.3.3
[Bugfix][connector-cdc-mysql] Fix listener not released when BinlogClient reuse (#5011)	https://github.com/apache/seatunnel/commit/3287b1d852	2.3.3
[BugFix][Connector-V2] [MySQL-CDC] serverId from int to long (#5033) (#5035)	https://github.com/apache/seatunnel/commit/4abc80e111	2.3.3
[Hotfix][CDC] Fix jdbc connection leak for mysql (#5037)	https://github.com/apache/seatunnel/commit/738925ba10	2.3.3
[Feature][CDC] Support disable/enable exactly once for INITIAL (#4921)	https://github.com/apache/seatunnel/commit/6d9a3e5957	2.3.3
[Improve][CDC]change driver scope to provider (#5002)	https://github.com/apache/seatunnel/commit/745c0b9e92	2.3.3
[Improve][CDC]Remove driver for cdc connector (#4952)	https://github.com/apache/seatunnel/commit/b65f40c3c9	2.3.3
[improve][CDC base] Implement Sample-based Sharding Strategy with Configurable Sampling Rate (#4856)	https://github.com/apache/seatunnel/commit/d827c700f0	2.3.2
[Hotfix][CDC] Fix chunk start/end parameter type error (#4777)	https://github.com/apache/seatunnel/commit/c13c031995	2.3.2
[feature][catalog] Support for multiplexing connections (#4550)	https://github.com/apache/seatunnel/commit/41277d7f78	2.3.2
[BugFix][Mysql-CDC] Fix Time data type is empty when reading from MySQL CDC (#4670)	https://github.com/apache/seatunnel/commit/e4f973daf7	2.3.2
[Improve][CDC] Optimize jdbc fetch-size options (#4352)	https://github.com/apache/seatunnel/commit/fbb60ce1be	2.3.1
[Improve][CDC] Improve startup.mode/stop.mode options (#4360)	https://github.com/apache/seatunnel/commit/b71d8739d5	2.3.1
Update CDC StartupMode and StopMode option to SingleChoiceOption (#4357)	https://github.com/apache/seatunnel/commit/f60ac1a5e9	2.3.1
[bugfix][cdc-base] Fix cdc base shutdown thread not cleared (#4327)	https://github.com/apache/seatunnel/commit/ac61409bd8	2.3.1
[Feature][CDC] Support export debezium-json format to kafka (#4339)	https://github.com/apache/seatunnel/commit/5817ec07bf	2.3.1
[Improve][CDC][MySQL] Ennable binlog watermark compare (#4293)	https://github.com/apache/seatunnel/commit/b22fb259c8	2.3.1
[Feature][CDC][Mysql] Support read database list (#4255)	https://github.com/apache/seatunnel/commit/3ca60c6fed	2.3.1
Add redshift datatype convertor (#4245)	https://github.com/apache/seatunnel/commit/b19011517f	2.3.1
[improve][zeta] fix zeta bugs	https://github.com/apache/seatunnel/commit/3a82e8b39f	2.3.1
[Improve] Support MySqlCatalog Use JDBC URL With Custom Suffix	https://github.com/apache/seatunnel/commit/210d0ff1f8	2.3.1
[chore] Code format with spotless plugin.	https://github.com/apache/seatunnel/commit/291214ad6f	2.3.1
Merge branch 'dev' into merge/cdc	https://github.com/apache/seatunnel/commit/4324ee1912	2.3.1
[Improve][Project] Code format with spotless plugin.	https://github.com/apache/seatunnel/commit/423b583038	2.3.1
[improve][jdbc] Reduce jdbc options configuration (#4218)	https://github.com/apache/seatunnel/commit/ddd8f808b5	2.3.1
[improve][cdc] support sharding-tables (#4207)	https://github.com/apache/seatunnel/commit/5c3f0c9b00	2.3.1
[Hotfix][CDC] Fix multiple-table data read (#4200)	https://github.com/apache/seatunnel/commit/7f5671d2ce	2.3.1
[Feature][Zeta] Support shuffle multiple rows by tableId (#4147)	https://github.com/apache/seatunnel/commit/8348f1a108	2.3.1
[Improve][build] Give the maven module a human readable name (#4114)	https://github.com/apache/seatunnel/commit/d7cd601051	2.3.1
[Feature][CDC] Support batch processing on multiple-table shuffle flow (#4116)	https://github.com/apache/seatunnel/commit/919653d83e	2.3.1
[Improve][Project] Code format with spotless plugin. (#4101)	https://github.com/apache/seatunnel/commit/a2ab166561	2.3.1
[Feature][CDC] MySQL CDC supports deserialization of multi-tables (#4067)	https://github.com/apache/seatunnel/commit/21ef45fcca	2.3.1
fix cdc option rule error (#4018)	https://github.com/apache/seatunnel/commit/ea160429df	2.3.1
[Improve][CDC][base] Guaranteed to be exactly-once in the process of switching from SnapshotTask to IncrementalTask (#3837)	https://github.com/apache/seatunnel/commit/8379aaf876	2.3.1
[Feature][Connector] add get source method to all source connector (#3846)	https://github.com/apache/seatunnel/commit/417178fb84	2.3.1
[Feature][API & Connector & Doc] add parallelism and column projection interface (#3829)	https://github.com/apache/seatunnel/commit/b9164b8ba1	2.3.1
[Improve][CDC] Add mysql-cdc source factory (#3791)	https://github.com/apache/seatunnel/commit/356538de8a	2.3.1
[feature][connector-v2] add sqlServer CDC (#3686)	https://github.com/apache/seatunnel/commit/0f0afb58af	2.3.0
[feature][e2e][cdc] add mysql cdc container (#3667)	https://github.com/apache/seatunnel/commit/7696ba1551	2.3.0
[feature][cdc] Fixed error in mysql cdc under real-time job (#3666)	https://github.com/apache/seatunnel/commit/2238fda300	2.3.0
[feature][connector][cdc] add SeaTunnelRowDebeziumDeserializeSchema (#3499)	https://github.com/apache/seatunnel/commit/ff44db116e	2.3.0
[feature][connector][mysql-cdc] add MySQL CDC enumerator (#3481)	https://github.com/apache/seatunnel/commit/ff4b32dc28	2.3.0
[bugfix][connector-v2] fix cdc mysql reader err (#3465)	https://github.com/apache/seatunnel/commit/1b406b5a31	2.3.0
[feature][connector] add mysql cdc reader (#3455)	https://github.com/apache/seatunnel/commit/ae981df675	2.3.0

MySQL CDC

Support Those Engines​

Description​

Key features​

Supported DataSource Info​

Using Dependency​

Install Jdbc Driver​

For Flink Engine​

For SeaTunnel Zeta Engine​

Creating MySQL user​

Enabling the MySQL Binlog​

Notes​

Setting up MySQL session timeouts​

Data Type Mapping​

Source Options​

int_type_narrowing​

Task Example​

Simple​

Support debezium-compatible format send to kafka​

Support custom primary key for table​

Support schema evolution​

Support table-pattern for multi-table reading​

Changelog​