2018/9/10(Mon), 9/11(Tue), 9/12(Wed), 9/13(Thu), 9/14(Fri)
9AM PT (Pacific Time Zone)
10 hrs of online classes (2 hrs a day) plus exercise homework.
Student discount (50% off) available. Click here.
It will include 6 month subscription of Exploratory Desktop - Personal Edition and the training materials. This will give you enough time and environment to keep improving your Data Science skills by applying what you have learned in the training to your real world data.
We will issue a certificate once you have completed our Data Science Booster training.
Individual payments for Data Science Booster Training must be made by credit card through our website. All prices are in US Dollars.
If you are not happy with your training experience for any reason, we will refund you with the full amount.
Who will be right for this training?
This training would be perfect if you :
- Want to learn how to apply Data Science in the real world business scenario.
- Want to start a journey of becoming a Data Scientist.
- Want to start Data Science projects but don’t know where to start.
- Wanted learn Data Science before, but gave up due to the steep learning curve of learning programming and/or statistics.
What you need before the training
- Being able to use Excel (or any other spreadsheet tools) and perform the basic calculations (e.g. sum, average, etc.).
- Being curious about Data.
- Having a desire and commitment to learn how to understand data better.
- Positive attitude towards to learning something new.
What you don’t need before the training
- Programming skill (if you have, of course you can do a lot more down the road, but not necessary for this training.)
- Statistics background (if you have, of course that makes things easier, but not necessary for this training.)
- Negative attitude towards to learning something new. ;)
“The Data Science Booster training is a very well designed course that guides you not only manipulating structured or unstructured data using the Exploratory GUI tool but also speeding your productivity in term of extracting insights from your data and enhancing your analysis. ”
Alan Ponce, Student at University of Southampton
“Had the pleasure of taking Exploratory’s Data Science Booster Training and enjoyed every minute! Kan demonstrated dozens of techniques in Exploratory’s brilliant UI and went into details precisely as much as needed (no more, no less). Lessons were broken into bite-sized pieces (survival analysis, forecasting, regression, unsupervised learning, data wrangling, much much more...), every day of training was engaging and well-designed.”
Brandon Weinberg, Data Analyst
Training class runs from 9AM - 11AM PT (US Pacific Timezone) each day from Septempber 10th - 14th, 2018.
Before the training starts, we ask you to install Exploratory Desktop onto your Mac (OSX 10.11 or later) or Windows (Windows7 or later), and complete Exploratory’s ‘Getting Started Tutorial’. This will allow us to spend more time on learning Data Science itself rather than learning how to use Exploratory.
- Install Exploratory (appx. 30 minutes, mostly the time for downloading necessary softwares.)
- Getting Started with Exploratory - Tutorial (appx. 60 - 90 minutes.)
We will send you a setup guide and a getting started tutorial, and support you via online meetings or chats if you encounter any issues or questions.
Day 1 - Getting into Data Science
9AM - 10AM PT
Welcome to Exploratory’s Data Science Training
- Introduction to Data Science, R, and Exploratory
You will learn the basics of Data Science, R, and Exploratory.
10AM - 11AM PT
Introduction to Exploratory Data Analysis and Data Visualization Hands-on
You will learn how to understand the overview of the data by looking at the numeric summaries, distribution of the data, and relationships between columns.
Day 2 - Gaining Insights by Data Wrangling
Unfortunately, the real world data never comes in clean formats that are ready for analysis. The good news is, by learning the grammar of Data Wrangling you can address various challenges in cleaning and preparing data for analysis.
9AM - 10AM PT
Introduction to Exploratory Data Wrangling - Part 1.
You will learn the basics of Data Wrangling grammar following a dplyr framework - a grammar of Data Wrangling - to explore the data effectively.
10AM - 11AM PT
Introduction to Exploratory Data Wrangling - Part 2. : Working with Text and Date Data
You will learn how to address the most common challenges when working with Text and Date data by cleaning and transforming them.
Day 3 - Gaining Insights by Statistical Algorithms
9AM - 10AM PT
Correlation Analysis - Introduction to Correlation Algorithm
You will learn what Correlation algorithm is and how you can use it to find highly correlated pairs of the variables.
Hands-on: Finding correlation among the web traffic measures and pick the measures that you want to pay attentions to.
10AM - 11AM PT
Similarity and Clustering Analysis - Introduction to Distance, Multi- Dimensional Scaling, and K-means Clustering algorithms.
You will learn Distance, MDS, and K-means clustering algorithms and how to use them to understand the relationships among the categories quickly.
Hands-on: Finding similarities among customers based on their past product purchase history and creating customer segmentation for a better targeted marketing campaign.
Day 4 - Gaining Insights by Machine Learning Algorithms - Part 1
9AM - 10AM PT
Regression Analysis - Introduction to Regression Algorithm
You will learn how to understand relationships among variables and the effect of each variable to one another by using the most commonly used algorithms - Linear and Logistic Regressions.
Hands-on: Predict customer purchase amount based on some of the attributes such as, the time spent on the web site, locations, date and time, past purchase history, etc.
10AM - 11AM PT
Variable Importance Analysis - Introduction to Classification and Random Forest Algorithm
You will learn how to use one of the most popular Machine Learning algorithms among data scientists - Random Forest - to understand what are the key factors that you want to pay attentions to in order to get your desired outcomes.
Hands-on: Discover what product features and customer attributes help customers convert to paid customers.
Day 5 - Gaining Insights by Machine Learning Algorithms - Part 2
9AM - 10AM PT
Time Series Forecasting - Introduction to Prophet Algorithm
You will learn how to forecast for time series data by building forecasting models.
Hands-on: Predict the web site traffic over the next 3 months based on the historical data so that you can prepare your team by allocating required resources better.
10AM - 11AM PT
Cohort Analysis - Introduction to Survival Analysis Algorithm
You will learn how to estimate survival rates by different cohorts (groups) by applying Survival Analysis algorithm.
Hands-on: Discover which cohorts are showing better (or worse) customer retention rates and what makes difference between the converted users and the not-converted.
- Office Hours for the Training Participants to Drop in
- One-on-One Meeting for Consultation / Mentoring
- Chat at Slack and at Exploratory’s website for questions and answers
- Monthly Update with the latest news in Data Science to keep your knowledge/skill up to date
After the training, we will schedule an Office Hours (3 hours) during which we make ourselves available for answering at Slack channels and at interactive web conference sessions. You can also book one hour of one-on-one meeting with us to discuss your questions and challenges with your data. We’d encourage you to take advantage of these opportunities to maximize the values of the training.
This training is designed to boost your Data Science skills so that you can start using Data Science methods right after the class. But the field of Data Science is huge and it is evolving rapidly every day. Becoming better at Data Science is a journey of continuously learning new techniques and algorithms, just like learning any other professions. We are here to help you making the journey as satisfying, valuable, and most importantly FUN!
The training is delivered entirely online over a period of 5 days. It consists of a series of one hour length modules. You will finish two modules a day throughout the 5 days by learning and exercising various Data Science methods ranging from Data Wrangling, Data Visualization, Statistics, and Machine Learning.
On-line Live Chat is available throughout the training hours. Training group channel is also available at Slack for discussions and questions even outside of the classes.
We recommend you to reserve a minimum commitment of 3 hours a day for joining the live on-line sessions and completing the exercise section for each module.
Every live session will be recorded and available for the participants to review it later. The participants can keep all the training materials for reviewing them later. However, it is prohibited for commercial uses or re-distribution.
Kan Nishida (CEO, Exploratory) Twitter
Led development teams to build various Data Science products including Machine Learning, BI, Data Visualization, Mobile Analytics, Database, etc. as a development director at Oracle, while building a team to provide training and consulting services to equip teams at Fortune 500 companies with Data Science skills. Beginning of 2016, started Exploratory to make the rapid pace of the advancements in Data Science accessible to 99% of people around the world. As a CEO and Chief Product/Education Officer, he’s spending most of his time making Data Science more accessible and easier by building tools and teaching.
Who Are We?
We had worked at some of the biggest enterprise software companies in Silicon Valley for building Data Science products and solutions for about twenty years before starting Exploratory, but never seen anything of this scale and speed of innovation that is currently undertaking in the world of Data Science in the last decades. Fortunately, much of the advancements are happening in Open Source communities so they are available to anybody. Unfortunately, much of such advancements have been locked in the form of computer languages (e.g. R, Python, etc.). So practically speaking, only the people with programming skills have been benefiting from such advancements, but people without the skills have not. And this has created a massive gap between them in terms of the qualities of decision making based on data.
We wanted to change that by building a simple and cohesive UI experience for Data Science so that more people can access such advancements and gain deeper insights from their everyday data to maximize their business and career opportunities. And that is Exploratory,
a tool that connects the modern Data Science and people without programming skills. Since we started in March 2016, we have now over 12,000 registered users in 86 countries all around the world who are making better decisions by understanding their data effectively.