Programs

Data Science

Data Preparation

  • Class 30
  • Practice 30
  • Independent work 120
Total 180

Course title

Data Preparation

Lecture type

Obligatory

Course code

20-02-018

Semester

1

ECTS

6

Lecturers and associates

Course objectives

For the data analysis to have high quality results, it is necessary to make the preparation of the input data. The aim of the course is to demonstrate basic methods of data preparation that includes methods of cleaning, transforming, introverting, normalizing and aggregating data, time series transformation, work with missing values as well as basic data reduction methods such as feature reduction, sample reduction, and discretization.

Content

Introduction to data preparation. Data cleaning. Work with missing values. Data Transformation. Sample Reduction. Aggregation of data. Transformation of time series. Data Integration. Normalization of data. Data discretization. Feature Reduction. Practice and Future. Exam preparation.

Required reading

Course handbook prepared and printed by Algebra University College

Additional reading

1. Salvador García, Julián Luengo, Francisco Herrera : Data Preprocessing in Data Mining (2016)
2. Appavu Balamurugan S., Arockia Christopher A.B.: Insight into Data Preprocessing: Theory and Practice: Data Mining Perspective (2012)
3. Soumen Chakrabarti, Earl Cox, Eibe Frank, Ralf Hartmut Güting, Jiawei Han, Xia Jiang, Micheline Kamber, Sam S. Lightstone: Data Mining: Know It All (2009)

Minimal learning outcomes

  • Address issues while preparing data.
  • Differentiate working methods with missing values and data transformation methods.
  • Differentiate the basic aggregation functions and methods of time series transformation.
  • Differentiate potential problems in the process of integration, normalization and data discretization and to know their potential solutions.
  • Differentiate the basic methods of reducing features and patterns.
  • List tools and technologies for data preparation in Big Data environments

Preferred learning outcomes

  • Recommend solutions for problems while preparing data.
  • Choose an adequate method for working with missing data and method of data transformation.
  • Select the appropriate aggregation functions and methods of time series transformation.
  • Select an adequate solution for a particular problem in the process of integration, normalization and data discretization.
  • Apply adequate basic methods of reducing features and patterns.
  • Understand the impact of new technologies on the process of data preparation
Share: Facebook Twitter

Excel at what you love doing. Light the spark.

Apply now!

Why is Algebra a safe choice for your future?

A Strong
Tailwind

Here you will learn all about information technologies and prepare for a career that is constantly in demand. We offer you a platform for personal growth that makes you a prime target for employers.

Modern Methodology

We refuse to stand still in a rapidly changing world. Our programs stay relevant and keep up with modern trends.

Matchless
Quality

We take pride in numerous accolades and our title of The best professional study program in Croatia and constantly strive to justify that trust. We do not take our task lightly, knowing that your future depends on it.

Newsletter

Stay informed about everything that goes on at the University. Subscribe to our newsletter.