data-processing

Blog Cover Image
Anomaly Detection in Machine Learning

Anomaly detection is a process of finding samples behaving abnormally compared to the majority of samples present in the dataset. Anomaly detection algorithms have important use-cases in Data Analytics and Data Science fields. For example, fraud analysts rely on anomaly detection algorithms to detect fraud in transactions.

Blog Cover Image
Need of Feature Scaling in Machine Learning

In this article, we will learn about one of the essential topics used in scaling different attributes for machine learning: Normalization and Standardization. Normalization and Standardization are the techniques used to scale all the features in the same range. It avoids the cases of biases on higher or lower magnitude features.

Blog Cover Image
Hands-on Methods for Data Pre-Processing of Structured Data

In this blog, we will do hands-on on several data preprocessing techniques in machine learning like Feature Selection, Feature Quality Assessment, Feature Sampling, and Feature Reduction. We will use different datasets for demonstration and briefly discuss the intuition behind the methods.

Blog Cover Image
Pre-processing of Time Series Data In Machine Learning

Time series data is found everywhere, and to perform the time series analysis, we must preprocess the data first. Time Series preprocessing techniques have a significant influence on data modelling accuracy.

Blog Cover Image
Word Vector Encoding: Make Machines Understand Text

Computers only understand numbers, not text. So we need to convert our text into vectors using vector encoding.

Blog Cover Image
Pre-processing of Text Data in Machine Learning Part 1

Text data pre-processing ensures optimal results when executed properly. Fortunately, Python has excellent support of NLP libraries such as NLTK, spaCy, and Gensim to ease our text analysis.

Our weekly newsletter

Subscribe to get free weekly content on data structure and algorithms, machine learning, system design, oops design and mathematics.