Skip to content

Spark 3.0: First hands-on approach with Adaptive Query Execution (Part 1)

Apache Spark is a distributed data processing framework that is suitable for any Big Data context thanks to its features. Despite being a relatively recent product (the first open-source BSD license was
Read More

The world is real-time, not batch – White Paper

WHITE PAPERTHE WORLD IS REAL TIME  NOT BATCH An overview of Data Streaming scenario, its stages of evolution and benefits. Are you getting your data fast enough? Why is streaming data
Read More

AWS Partnership

We are proud to announce that we have been recognized “AWS Select Consulting Partner” within the Amazon Partner Network (APN). The Select status achievement demonstrates our commitment in delivering top technology
Read More

How to create an Apache Spark 3.0 development cluster on a single machine using Docker

Apache Spark is the most widely used in-memory parallel distributed processing framework in the field of Big Data advanced analytics. The main reasons for its success are the simplicity of use
Read More