Spark in Industry – Webinar 15 maggio 2020, 9.30am

Quali sono i framework tipici del mondo Big Data? Perchè Spark è il più diffuso e come viene adottato in contesti industriali? Quali sono gli use case più significativi?

Queste sono solo alcune delle domande a cui risponderanno Paolo Platter, Agile Lab Founder & CTO e Domenico Potena, Professore Associato del Dipartimento di Ingegneria dell’Informazione dell’Università Politecnica delle Marche, durante il webinar organizzato per il prossimo 15 maggio 2020, alle ore 9,30.

AGENDA: (9,30 am – 11,30 am)

– Lecture su Spark: concetti base, batch, Streaming & Machine Learning
– Scenari reali di applicazioni in produzione
– Q&A

Clicca qui per registrarti

Ti aspettiamo!

Big data pipeline with k8s & Deep learning for NLP – On-Line Meetup, 28th April 2020 | 7pm

Are you interested in Big data pipeline with k8s & Deep learning for NLP?

If you want to discover more, don’t miss our next meetup, with two talk:

  • “How to run big data pipeline in production with k8s”
    In several big data projects we use k8s to guarantee reliability and to automatically manage application failures.
    In this talk we show you how we used k8s in conjunction with Cloudera to deliver our mission critical applications in big data production 24/7. Speaker: Matteo Bovetti + Carlo Ventrella, Site Reliability Engineers @ Agile Lab
  • “NLP e Deep Learning: dal Information retrieval a BERT”
    Quali sono le difficoltà intrinseche e le limitazioni nella gestione dell’informazione testuale. Storia delle tecnologie NLP dai sistemi di information retrieval a BERT, modello SOTA per la maggior parte dei task affrontati dal NLP moderna. Presentazione di un caso d’uso reale in ambito chatbot e implicazioni dell’utilizzo del Deep Learning in ambito NLP. Speaker: Gianpiero Sportelli, AI Developer @ CELI – Language Technology

We have moved on-line, but at the end, we’ll still have time to discuss, gather proposals and suggestions for the next meetups.

Click here to sign up!

 

 

WASP & Optimizing massive HBase updates – On Line Meetup, 9th April 2020 | 7pm

Are you interested in WASP & Optimizing massive HBase updates?

If you want to discover more, don’t miss our next meetup, with two talk:
• “WASP: Wide Analytics Streaming Platform”, speaker: Antonio Murgia Big Data Engineer @ AgileLab.
• “Optimizing massive HBase updates with Apache Spark”, speaker: Luca Priscoglio e Andrea Venneri, Big Data Engineers @ Data Reply.

Due to the restrictive measures taken to deal with the covid-19 infection, we have decided for an online meetup.

See (and hear) you on 9th April 2020,  at 19:00, on YouTube.

At the end, time to discuss, socialize, gather proposals and suggestions for the next meetups.

Click here to sign up!

 

 

Using pySpark with Google Colab & Spark 3.0 preview – Meetup in Milan, 11th December 2019

Are you an Apache Spark user? Have you ever use pySpark leveraging the cloud service Google Colab?

If you want to discover more, don’t miss our next meetup, held by our evangelist Mario Cartia, who will introduce the architecture, the modules and the main functionalities of the framework showing some practical examples in Python, that do not require the installation of any software on your machine, using the Colab tool made available by Google.
The talk will also introduce the new features available on the preview version of the upcoming Apache Spark 3.0

At the end, free drinks for everyone and time to discuss, socialize, gather proposals and suggestions for the next meetups.

See you on 11th December 2019, at 18,30, in Milan, at YoRoom Coworking & Office, via Pastrengo, 14.

Click here to sign up!

Getting Started with Spark & Cassandra And using pySpark with Google Colab – Meet Up Milano – 19/06/2019

On Wednesday, June 19th, in Milan, another must-attend meetup is coming with two interesting talks:
Talk 1 – Getting Started with Spark & Cassandra – Bruno Guedes, Datastax – Technical Sales Engineer
How do we analyze big amount of data, whether it be massive batching or as it’s ingested via streaming ?
Enter Apache Spark. Get ready to rock with the most powerful distributed database and analytics platform on the planet!
Massively scalable, always on, and very fast. Apache Cassandra is the database chosen by Apple, Netflix, and 30 of the Fortune 100 to power their critical infrastructure.
Challenging MapReduce head on, Apache Spark offers powerful constructs that make it possible to slice and dice your data, as well as transformations with functional programming
backgrounds such as map, filter, and reduce.
Talk 2 – Getting started with pySpark in 5 minutes using Google Colab – Mario Cartia, Agile Lab – AI & Big Data Consultant/Evangelist/Trainer
Apache Spark is the Big Data opensource framework used by the world’s leading companies for the implementation of advanced analytics. The talk will introduce the architecture, the modules and the main functionalities of the framework showing some practical examples in Python that do not require the installation of any software on your machine using the Colab tool made available by Google.
And at the end, cocktail and FREE Drinks for everyone offered by Agile Lab!

Coworking Tortona – via Tortona, 33 – Milano

Apache Livy and Demystifying Typeclasses – Meetup Torino – 06/06/2019

In this Meet up we will have two talks, the first is the new Apache Livy and the second will concern Demystifying Typeclasses.
Speech 1 – The new Apache Livy – Marco Gaido (Big Data Engineer @ Agile Lab)
Apache Livy will take a big step forward in the next release: the main changes are support for Spark 2.4 and a new JDBC interface. But why do we need a JDBC interface in Livy? Isn’t Spark Thriftserver enough? In this speech we will answer these questions and immerse ourselves deeply in the way the new Thriftserver Livy was designed.
Speech 2 – Demystifying typeclasses – Andrea Fonti (Big Data Engineer @ Agile Lab)
In this talk we will discuss the concept of ad-hoc polymorphism and the “Type Class” pattern. We will start from a basic function able to sum two integers and construct an abstraction that will lead us to a polymorphic function able to aggregate an arbitrary number of values.
Join us!

Deep Learning On The Edge and Server Based: Intro, Differences and Use Cases – Meetup Milano – 18/06/2018

In this Agile Lab Meetup, the speaker lineup was as follows:

• Luca Ruzzola, Data Scientist at Agile Lab.
He will introduce us in the Deep Learning world, with a focusing on differences in hardware, software, and models between Deep Learning on the edge and server based.

• Vincenzo Manzoni, Data Science Director at Tenaris.
He will describe an server-based application, implemented by Tenaris, which exploits a very accurate image classification algorithm in the field of manufacturing.

• Daniele Cleri, Senior Software Solutions Architect at AAEON Technology.
His presentation will regard the implementation of a new interactive and Intelligent Retail System for the cosmetic industry, enhanced by the Intel Movidius Myriad 2 VPU.

For those who did not attend our meetup:

Discover our Meetup “Apache Spark & More Milano”