Volltext-Downloads (blau) und Frontdoor-Views (grau)
The search result changed since you submitted your search request. Documents might be displayed in a different sort order.
  • search hit 2 of 7
Back to Result List

Data Preprocessing: A New Algorithm for Univariate Imputation Designed Specifically for Industrial Needs

  • Data pre-processing is a key research topic in data mining because it plays a crucial role in improving the accuracy of any data mining algorithm. In most real world cases, a significant amount of the recorded data is found missing due to most diverse errors. This loss of data is nearly always unavoidable. Recovery of missing data plays a vital role in avoiding inaccurate data mining decisions. Most multivariate imputation methods are not compatible to univariate datasets and the traditional univariate imputation techniques become highly biased as the missing data gap increases. With the current technological advancements abundant data is being captured every second. Hence, we intend to develop a new algorithm that enables maximum utilization of the available big datasets for imputation. In this paper, we present a Seasonal and Trend decomposition using Loess (STL) based Seasonal Moving Window Algorithm, which is capable of handling patterns with trend as well as cyclic characteristics. We show that the algorithm is highly suitable for pre-processing of large datasets.

Download full text files

Export metadata

Additional Services

Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Sowmya Chandrasekaran, Martin Zaefferer, Steffen Moritz, Jörg Stork, Martina Friese, Andreas Fischbach, Thomas Bartz-BeielsteinGND
URN:urn:nbn:de:hbz:832-cos4-4331
Series (Serial Number):CIplus (7/2016)
Document Type:Working Paper
Language:English
Year of Completion:2016
Release Date:2016/11/28
Tag:Imputation; Time Series; Univariate Data
Page Number:24
Institutes and Central Facilities:Fakultät für Informatik und Ingenieurwissenschaften (F10)
CCS-Classification:G. Mathematics of Computing / G.1 NUMERICAL ANALYSIS / G.1.2 Approximation
Dewey Decimal Classification:000 Allgemeines, Informatik, Informationswissenschaft / 004 Informatik
Open Access:Open Access
Licence (German):License LogoCreative Commons - Namensnennung, Nicht kommerziell, Keine Bearbeitung