data source for real time ingestion pipeline poc -- fork of https://github.com/DoTrongAnh/scala-data-stream-test.git