◐ Shell
clean mode source ↗

GitHub - MTSWebServices/syncmaster: No-code ETL tool, based on onETL + PySpark

Data.SyncMaster logo

Repo Status Docker image PyPI PyPI License PyPI Python Version Documentation Build Status Coverage pre-commit.ci

What is Data.SyncMaster?

Data.SyncMaster is as no-code ETL tool for transferring data between databases and file systems. List of currently supported connections:

  • Apache Hive
  • Clickhouse
  • Iceberg (REST Catalog + S3)
  • MSSQL
  • MySQL
  • Oracle
  • Postgres
  • FTP
  • FTPS
  • HDFS
  • S3
  • Samba
  • SFTP
  • WebDAV

Based on onETL and Apache Spark.

Goals

  • Make transferring data between databases and file systems as simple as possible
  • Provide a lot of builtin connectors to transfer data in heterogeneous environment
  • RBAC and multitenancy support

Non-goals

  • No streaming support for now, only batch operations
  • This is not a change data capture (CDC) solution

Documentation

See https://syncmaster.readthedocs.io