Data Mining and Analysis Training Course
Objective:
Delegates be able to analyse big data sets, extract patterns, choose the right variable impacting the results so that a new model is forecasted with predictive results.
Course Outline
-
Data preprocessing
- Data Cleaning
- Data integration and transformation
- Data reduction
- Discretization and concept hierarchy generation
-
Statistical inference
- Probability distributions, Random variables, Central limit theorem
- Sampling
- Confidence intervals
- Statistical Inference
- Hypothesis testing
-
Multivariate linear regression
- Specification
- Subset selection
- Estimation
- Validation
- Prediction
-
Classification methods
- Logistic regression
- Linear discriminant analysis
- K-nearest neighbours
- Naive Bayes
- Comparison of Classification methods
-
Neural Networks
- Fitting neural networks
- Training neural networks issues
-
Decision trees
- Regression trees
- Classification trees
- Trees Versus Linear Models
-
Bagging, Random Forests, Boosting
- Bagging
- Random Forests
- Boosting
-
Support Vector Machines and Flexible disct
- Maximal Margin classifier
- Support vector classifiers
- Support vector machines
- 2 and more classes SVM’s
- Relationship to logistic regression
-
Principal Components Analysis
-
Clustering
- K-means clustering
- K-medoids clustering
- Hierarchical clustering
- Density based clustering
-
Model Assesment and Selection
- Bias, Variance and Model complexity
- In-sample prediction error
- The Bayesian approach
- Cross-validation
- Bootstrap methods
Open Training Courses require 5+ participants.
Data Mining and Analysis Training Course - Booking
Data Mining and Analysis Training Course - Enquiry
Data Mining and Analysis - Consultancy Enquiry
Consultancy Enquiry
Testimonials (5)
I was benefit from the guidance and sharing life examples + answering all questions.
Marta Melloch - Amazon Development Center Poland Sp. z o.o.
Course - Data Mining and Analysis
I really enjoyed the all the best.
Halil polat - Amazon Development Center Poland Sp. z o.o.
Course - Data Mining and Analysis
The information given was interesting and the best part was towards the end when we were provided with Data from Durex and worked on Data we are familiar with and perform operations to get results.
Jessica Chaar
Course - Data Mining and Analysis
The hands-on exercise and the trainer capacity to explain complex topics in simple terms.
youssef chamoun
Course - Data Mining and Analysis
I like the exercises done.
Nour Assaf
Course - Data Mining and Analysis
Upcoming Courses (Minimal 5 peserta)
Related Courses
Programming with Big Data in R
21 HoursBig Data is a term that refers to solutions destined for storing and processing large data sets. Developed by Google initially, these Big Data solutions have evolved and inspired other similar projects, many of which are available as open-source. R is a popular programming language in the financial industry.
R Fundamentals
21 HoursR is an open-source free programming language for statistical computing, data analysis, and graphics. R is used by a growing number of managers and data analysts inside corporations and academia. R has also found followers among statisticians, engineers and scientists without computer programming skills who find it easy to use. Its popularity is due to the increasing use of data mining for various goals such as set ad prices, find new drugs more quickly or fine-tune financial models. R has a wide variety of packages for data mining.
Data Mining with R
14 HoursR is an open-source free programming language for statistical computing, data analysis, and graphics. R is used by a growing number of managers and data analysts inside corporations and academia. R has a wide variety of packages for data mining.
Econometrics: Eviews and Risk Simulator
21 HoursThis instructor-led, live training in Indonesia (online or onsite) is aimed at anyone who wishes to learn and master the fundamentals of econometric analysis and modeling.
By the end of this training, participants will be able to:
- Learn and understand the fundamentals of econometrics.
- Utilize Eviews and risk simulators.
Forecasting with R
14 HoursThis instructor-led, live training in Indonesia (online or onsite) is aimed at intermediate-level data analysts and business professionals who wish to perform time series forecasting and automate data analysis workflows using R.
By the end of this training, participants will be able to:
- Understand the fundamentals of forecasting techniques in R.
- Apply exponential smoothing and ARIMA models for time series analysis.
- Utilize the ‘forecast’ package to generate accurate forecasting models.
- Automate forecasting workflows for business and research applications.
HR Analytics for Public Organisations
14 HoursThis instructor-led, live training (online or onsite) is aimed at HR professionals who wish to use analytical methods improve organisational performance. This course covers qualitative as well as quantitative, empirical and statistical approaches.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Marketing Analytics using R
21 HoursAudience
Business owners (marketing managers, product managers, customer base managers) and their teams; customer insights professionals.
Overview
The course follows the customer life cycle from acquiring new customers, managing the existing customers for profitability, retaining good customers, and finally understanding which customers are leaving us and why. We will be working with real (if anonymous) data from a variety of industries including telecommunications, insurance, media, and high tech.
Format
Instructor-led training over the course of five half-day sessions with in-class exercises as well as homework. It can be delivered as a classroom or distance (online) course.
R for Data Analysis and Research
7 HoursAudience
- managers
- developers
- scientists
- students
Format of the course
on-line instruction and discussion OR face-to-face workshops
Introduction to R
21 HoursR is an open-source free programming language for statistical computing, data analysis, and graphics. R is used by a growing number of managers and data analysts inside corporations and academia. R has also found followers among statisticians, engineers and scientists without computer programming skills who find it easy to use. Its popularity is due to the increasing use of data mining for various goals such as set ad prices, find new drugs more quickly or fine-tune financial models. R has a wide variety of packages for data mining.
This course covers the manipulation of objects in R including reading data, accessing R packages, writing R functions, and making informative graphs. It includes analyzing data using common statistical models. The course teaches how to use the R software (https://www.r-project.org) both on a command line and in a graphical user interface (GUI).
R
21 HoursR is an open-source free programming language for statistical computing, data analysis, and graphics. R is used by a growing number of managers and data analysts inside corporations and academia. R has also found followers among statisticians, engineers and scientists without computer programming skills who find it easy to use. Its popularity is due to the increasing use of data mining for various goals such as set ad prices, find new drugs more quickly or fine-tune financial models. R has a wide variety of packages for data mining.
Neural Network in R
14 HoursThis course is an introduction to applying neural networks in real world problems using R-project software.
Advanced R Programming
7 HoursThis course is for data scientists and statisticians that already have basic R & C++ coding skills and R code and need advanced R coding skills.
The purpose is to give a practical advanced R programming course to participants interested in applying the methods at work.
Sector specific examples are used to make the training relevant to the audience
Statistical Analysis using SPSS
21 HoursThis instructor-led, live training in Indonesia (online or onsite) is aimed at beginner-level to intermediate-level professionals who wish to perform statistical analysis using SPSS to interpret data accurately, run complex statistical tests, and generate meaningful insights.
By the end of this training, participants will be able to:
- Navigate the SPSS interface and manage datasets efficiently.
- Perform descriptive and inferential statistical analyses.
- Conduct t-tests, ANOVA, MANOVA, regression, and correlation analyses.
- Apply non-parametric tests, principal component analysis, and factor analysis for advanced data interpretation.
Talent Acquisition Analytics
14 HoursThis instructor-led, live training (online or onsite) is aimed at HR professionals and recruitment specialists who wish to use analytical methods improve organisational performance. This course covers qualitative as well as quantitative, empirical and statistical approaches.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Introduction to Data Visualization with Tidyverse and R
7 HoursThe Tidyverse is a collection of versatile R packages for cleaning, processing, modeling, and visualizing data. Some of the packages included are: ggplot2, dplyr, tidyr, readr, purrr, and tibble.
In this instructor-led, live training, participants will learn how to manipulate and visualize data using the tools included in the Tidyverse.
By the end of this training, participants will be able to:
- Perform data analysis and create appealing visualizations
- Draw useful conclusions from various datasets of sample data
- Filter, sort and summarize data to answer exploratory questions
- Turn processed data into informative line plots, bar plots, histograms
- Import and filter data from diverse data sources, including Excel, CSV, and SPSS files
Audience
- Beginners to the R language
- Beginners to data analysis and data visualization
Format of the course
- Part lecture, part discussion, exercises and heavy hands-on practice