Data Science Certification Training
Edureka’s Data Science Training lets you gain expertise in Machine Learning Algorithms like K-Means Clustering, Decision Trees, Random Forest, and Naive Bayes using R. Data Science Training encompasses a conceptual understanding of Statistics, Time Series, Text Mining and an introduction to Deep Learning. Throughout this Data Science Course, you will implement real-life use-cases on Media, Healthcare, Social Media, Aviation and HR.
Introduction to Data Science
Learning Objective: Get an introduction to Data Science in this module and see how Data Science helps to analyze large and unstructured data with different tools.
- What is Data Science?
- What does Data Science involve?
- Era of Data Science
- Business Intelligence vs Data Science
- Life cycle of Data Science
- Tools of Data Science
- Introduction to Big Data and Hadoop
- Introduction to R
- Introduction to Spark
- Introduction to Machine Learning
Learning Objective: In this module, you will learn about different statistical techniques and terminologies used in data analysis.
- What is Statistical Inference?
- Terminologies of Statistics
- Measures of Centers
- Measures of Spread
- Normal Distribution
- Binary Distribution
Data Extraction, Wrangling and Exploration
Learning Objective: Discuss the different sources available to extract data, arrange the data in structured form, analyze the data, and represent the data in a graphical format.
- Data Analysis Pipeline
- What is Data Extraction
- Types of Data
- Raw and Processed Data
- Data Wrangling
- Exploratory Data Analysis
- Visualization of Data
- Loading different types of dataset in R
- Arranging the data
- Plotting the graphs
Introduction to Machine Learning
Learning Objective: Get an introduction to Machine Learning as part of this module. You will discuss the various categories of Machine Learning and implement Supervised Learning Algorithms.
- What is Machine Learning?
- Machine Learning Use-Cases
- Machine Learning Process Flow
- Machine Learning Categories
- Supervised Learning algorithm: Linear Regression and Logistic Regression
- Implementing Linear Regression model in R
- Implementing Logistic Regression model in R
Learning Objective: In this module, you should learn the Supervised Learning Techniques and the implementation of various techniques, such as Decision Trees, Random Forest Classifier, etc.
- What are classification and its use cases?
- What is Decision Tree?
- Algorithm for Decision Tree Induction
- Creating a Perfect Decision Tree
- Confusion Matrix
- What is Random Forest?
- What is Naive Bayes?
- Support Vector Machine: Classification
- Implementing Decision Tree model in R
- Implementing Linear Random Forest in R
- Implementing Naive Bayes model in R
- Implementing Support Vector Machine in R
Learning Objective: Learn about Unsupervised Learning and the various types of clustering that can be used to analyze the data.
- What is Clustering & its use cases
- What is K-means Clustering?
- What is C-means Clustering?
- What is Canopy Clustering?
- What is Hierarchical Clustering?
- Implementing K-means Clustering in R
- Implementing C-means Clustering in R
- Implementing Hierarchical Clustering in R
Learning Objective: In this module, you should learn about association rules and different types of Recommender Engines.
- What is Association Rules & its use cases?
- What is Recommendation Engine & it’s working?
- Types of Recommendations
- User-Based Recommendation
- Item-Based Recommendation
- Difference: User-Based and Item-Based Recommendation
- Recommendation use cases
- Implementing Association Rules in R
- Building a Recommendation Engine in R
Learning Objective: Discuss Unsupervised Machine Learning Techniques and the implementation of different algorithms, for example, TF-IDF and Cosine Similarity in this Module.
- The concepts of text-mining
- Use cases
- Text Mining Algorithms
- Quantifying text
- Beyond TF-IDF
- Implementing Bag of Words approach in R
- Implementing Sentiment Analysis on Twitter Data using R
Learning Objective: In this module, you should learn about Time Series data, different component of Time Series data, Time Series modeling – Exponential Smoothing models and ARIMA model for Time Series Forecasting.
- What is Time Series data?
- Time Series variables
- Different components of Time Series data
- Visualize the data to identify Time Series Components
- Implement ARIMA model for forecasting
- Exponential smoothing models
- Identifying different time series scenario based on which different Exponential Smoothing model can be applied
- Implement respective ETS model for forecasting
- Visualizing and formatting Time Series data
- Plotting decomposed Time Series data plot
- Applying ARIMA and ETS model for Time Series Forecasting
- Forecasting for given Time period
Learning Objective: Get introduced to the concepts of Reinforcement learning and Deep learning in this module. These concepts are explained with the help of Use cases. You will get to discuss Artificial Neural Network, the building blocks for Artificial Neural Networks, and few Artificial Neural Network terminologies.
- Reinforced Learning
- Reinforcement learning Process Flow
- Reinforced Learning Use cases
- Deep Learning
- Biological Neural Networks
- Understand Artificial Neural Networks
- Building an Artificial Neural Network
- How ANN works
- Important Terminologies of ANN’s