Automated Pipeline for Statistics and Advanced Analytics

November 03, 2022

Statistics Netherlands (CBS) is considered to be one of the most advanced national statistical offices in the world. And the secret behind their modern approach to statistical analytics?

Automation. . .  Containerization. . .  &  Greenplum!

If you are attending VMware Explore Europe, then you don't want to miss session #DOSB3081EUR as Martin Visser (VMware) and Sander Janssen (CBS) describe how CBS manages a fully virtualized VMware Greenplum environment and how CBS built a fully automated statistical processing pipeline to speed up the delivery of the Netherlands statistics.

 

Automate a Statistical Analytics Pipeline with Greenplum and Containers [DOSB3081EUR]

Recently a team of five international experts concluded that Statistics Netherlands (CBS) is one of the most advanced national statistical offices in the world. In this session, find out how Statistics Netherlands(CBS) manages a fully virtualized VMware Greenplum environment and how CBS built a fully automated statistical processing pipeline to speed up the delivery of the Netherlands statistics. As one of the leading statistical agencies in the world, CBS process hundreds of datasets every day. The statistical data is processed in parallel using highly complex R code. This code is executed in custom-built containers next to the data in VMware Greenplum. The containers are delivered to the Greenplum platform through a fully automated build pipeline run on Microsoft DevOps and Ansible automation. 

Martin Visser, Data Specialist, VMware 
Sander Janssen, IT Infrastructure Specialist, Centraal Bureau Voor De Statistiek 

Thursday, Nov 10  09:00 AM - 10:00 CET 
Fira Barcelona Gran Via - Hall 8.0, Room 16

Filter Tags

Automation AI/ML Application Acceleration Modern Applications Blog Announcement Databases Overview