What is Apache Gobblin?
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems. It is open source and free to use under an Apache 2.0 license.
Read more details.