Modern Data Lake on AWS for CBR Sciences

Share this article:

DataGrokr built a modern data lake for CBR Sciences, a leading marketing analytics consulting organization that specializes in building models of customer behavior. The data lake ingested data from various third-party providers providing billions of transaction data and has a size of 45TB. The complete cloud infrastructure was built up by DataGrokr to PCI compliant security standards. The data lake uses various AWS data services for transforming and storing the data. Gitlab CI/CD pipelines are used for code deployment and orchestrating the data pipelines.

About the Client

Our client CBR Sciences, a leading marketing analytics consulting organization that specializes in building models of customer behavior. They acquire data from a variety of public and proprietary data sources to create customized models to serve the needs of online businesses. They help level the playing field allowing small business to compete with larger enterprises.

Client’s need and Problem statement

The Client CBR Sciences, was looking to build a scalable and cost-effective data platform that would allow them to ingest data from multiple data sources, process and store the data. The platform had to be highly secure and needed to meet PCI security standards. They also wanted the platform to have modern ML Ops capabilities and facilitate data scientists and DevOps professionals to interact in a seamless fashion to enable shorter deployment time for their models. They were looking for a solution provider who not only had expertise in Data Engineering but also who could build their cloud environment from scratch and help them adopt ML Ops principles.

Tech Stack

Our solution and outcomes

Scroll to Top