Building Data Pipelines with Open Source Cloud Services

Following the first wave of cloud adoption, moving from private data centers and on-premises hosting to cloud infrastructure platforms such as AWS and GCP, we're now seeing a second wave of adoption w …
Talk Title | Building Data Pipelines with Open Source Cloud Services |
Speakers | Heikki Nousiainen (CTO, Aiven) |
Conference | Automotive Linux Summit & Open Source Summit Japan |
Conf Tag | |
Location | Tokyo, Japan |
Date | Jun 19-22, 2018 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
Following the first wave of cloud adoption, moving from private data centers and on-premises hosting to cloud infrastructure platforms such as AWS and GCP, we’re now seeing a second wave of adoption where self-managed and operated database and messaging services are being replaced by managed cloud database services like RDS. Many of the most popular and longstanding Open Source database systems such as PostgreSQL and MySQL are available in these Database-as-a-Service platforms. However, newer technologies used for building scalable data pipelines are commonly proprietary, locking you into a single vendor and out of the vast Open Source innovation ecosystem. We’ll look into available Open Source cloud alternatives, many of which provide superior performance without vendor lock-in, and how they can be used to build data pipelines.