# PROFESSIONAL COURSE IN DATA SCIENCE

## Program Description

More and more businesses today are using Data Science to add value to every aspect of their operations. This has led to a substantial increase in the demand for Data Scientists who are skilled in technology, mathematics and business. However, the supply has not kept pace with the demand, creating many highly paid job opportunities for Data Scientists. This extensive 6 months' training in Data Science gives you broad exposure to key concepts and tools from Python, R to Machine Learning and much more. After 1400+ hours of training, you will be ready to face any Data Science challenge.

Being a process-oriented organization that provides data science training, the trainees will be evaluated for certification on the basis of performance in following criteria: -

- Academic performance
- Assignment Scores
- Attitude, Punctuality and Dedication
- Live project performance reports by client and academic group.

## Module 1: Programming Basics

It is acknowledged by industry experts that anybody who is comfortable and understands the basics of programming such as loops, functions, if-else, and programming logic can become a successful data scientist. Being a good programmer is a highly preferred skill for a data scientist and that's where this module will help you.

A data scientist spends majority of their time either cleaning raw data in order to make it usable or implementing appropriate ML algorithms to extract underlying relationships that will help in business decision making. All this is carried out using Python or R. Being a vital skill for a Data Scientist, knowledge of Python goes a long way in reducing the learning curve for a Data Science enthusiast. This section will cover basic concepts in Python and will provide you an opportunity to polish your programming skills.

## Module 2: Mathematics Basics

Mathematics is the backbone for Data Science domain. May it be an implementation of a simple uni-variate Linear Regression model or application of statistical concepts for exploring data, understanding the underlying mathematical concept is pertinent for a successful Data Science career. This is way a Data scientist must have a strong mathematical foundation. This section will cover the mathematical concepts required for this course.

Probability and Statistics form the basis of Data Science. Estimates and predictions form an important part of Data science and probability theory is very much helpful for making predictions. Similarly, Statistics is also an integral part of Exploratory Data Analysis. This section will clear your understanding on Statistics and Probability

## Module 3: Machine Learning Basics

With the help of Machine Learning, a Data Scientist is able to analyze huge amounts of data in real time. It helps in understanding the underlying trend or relationship present in the data and because of this reason, Machine Learning has become an integral part of Data Science. This section will delve into the basics of Machine Learning and types of Machine Learning.

When working with huge amounts of data, a Data Scientist uses Machine Learning to implement statistical models so as to make a predictive analysis based on underlying relationships present in data. However, there are different types of Machine Learning and their knowledge will provide you with the required understanding needed to choose the right one for a job. This section will covers basics of Machine Learning and its various types.

## Module 4: Supervised Machine Learning

Supervised Machine Learning is used when we have to map the relationship that transforms the input into the output. It is used only in such scenarios where we have ample amount of data such that we know what the output is based on a given set of input values. The goal is to approximate the mapping function so well that when we have new input data x we can predict the output variable Y for that data. It is called supervised learning because the process of algorithm learning from the data can be thought of as a teacher supervising the learning process.

This model is useful for finding relationship between one or more independent variables and a dependent variable. It is used only when data is continuous. We try to find a "Best-fit" line which actually represents the relationship between the dependent and independent variables involved. The sole aim is to find a linear relationship such that it's prediction error for all data points is as small as possible. This will cover the mathematical concepts and implementation details for a Uni-variate Linear Regression model.

## Module 5: Unsupervised Machine Learning

In the real world, many a times, a Data Scientist is faced with a situation where only the predictor or input variable is known with no corresponding data for output variable. In such scenarios, Unsupervised Machine Learning comes to our aid. The goal for unsupervised learning is to understand or learn the underlying relationship or distribution in the data in order to learn more about it. These are called unsupervised learning because unlike supervised learning there are no correct answers and there is no teacher.

Unsupervised Machine Learning algorithms try to find underlying trend or pattern based on the values of independent variables without having any knowledge regarding the actual values of dependent variables under those circumstances. Though, unsupervised machine learning methods cannot be directly applied to a regression or a classification problem, they are useful in clustering, Anomaly detection, Association mining, etc. This section will help you in implementing unsupervised machine learning algorithms.

## Module 6: Deep Learning

Deep Learning is a subfield of machine learning which comprises of algorithms that try to mimic the structure and function of the human brain. Just like our brain learns from experience, a deep learning algorithm would perform a task repeatedly in order to improve the outcome by learning or improving from experience. The word 'deep learning' refers to neural networks having various deep layers that enables complex learning behavior. With an increase in generation of huge data on a daily basis, relevance of Deep-learning algorithms has soared recently. Availability of strong computing power have also contributed in increased usage of such algorithms. Under this section you will cover the necessary concepts of Deep Learning.

## Module 7: Machine Learning Project Implementation

This will be of 3 months duration with hands-on training and development of live Machine Learning projects as per the requirement of our client. This will be unique learning experience where you can learn about implementation and understand how professionals work in a development scenario. This training will help you to be prepared to crack interviews in field of Data Science and help you get your dream job.