workshop Machine learning with spark
Workshop date & duration: November 5th – Tuesday, 14:00 – 18:00
Trainer: Sorin Peste
Supporting students: Alexandru Petrescu, Laurentiu Partas
Location: TechHub Bucuresti
Price: Free (upon approval by the organizer & trainer)
Number of places: 20 no more places left
Languages: Python
Description:
We are coming back in November with a new workshop on Machine Learning – this time with how to build a model using Spark ML decision trees and gradient boosting.
So, come join us for an afternoon in which we will explore Apache Spark’s Machine Learning capabilities. We’ll be looking at using Spark to build a Credit Scoring model which estimates the probability of default for current and existing customers.
Agenda:
1. Intro, problem description and setup
2. Loading data
3. Exploratory data analysis
4. Feature engineering
5. Building our first model
6. Testing and validation
7. Improving the model with cross-validation and hyper-parameter tuning
8. Deployment
9. Considerations for running in production
Prerequisites
Bring your laptop! Beyond that, the only requirement is a web browser.
We will be using a managed Spark platform, called Azure Databricks, for the lab.
We will provide access to Azure Databricks and everything else you need.
If you would like to attend this workshop please complete the form below and read the note regarding the confirmation for participation in this event.