Computer Vision (1)
Generative AI (2)
Machine Learning Basics (18)
Deep Learning (52)
- DL Basics (16)
- DL Architectures (17)
  - Feedforward Network / MLP (2)
  - Sequence models (6)
  - Transformers (9)
- DL Training and Optimization (17)
Natural Language Processing (27)
- NLP Data Preparation (18)
Supervised Learning (115)
- Regression (41)
  - Linear Regression (26)
  - Generalized Linear Models (9)
  - Regularization (6)
- Classification (70)
  - Logistic Regression (10)
  - Support Vector Machine (9)
  - Ensemble Learning (24)
  - Other Classification Models (9)
  - Classification Evaluations (9)
Unsupervised Learning (55)
- Clustering (37)
  - Distance Measures (9)
  - K-Means Clustering (9)
  - Hierarchical Clustering (3)
  - Gaussian Mixture Models (5)
  - Clustering Evaluations (6)
- Dimensionality Reduction (9)
Statistics (34)
Data Preparation (35)
- Feature Engineering (30)
- Sampling Techniques (5)

What is an Outlier?

Updated: March 26, 2023

An outlier is an observation that is located far away relative to the distribution of the remaining observations. In the regression context, the term outlier is usually used in the context of the target variable, where observations far away in the feature space are called leverage or influence points.

Outliers are often identified subjectively, but common heuristics include points beyond 1.5 interquartile ranges from the first and third quartiles, or those a certain number of standard deviations beyond the mean. Outliers can be problematic when they have undue influence on an algorithm’s fit, such as pulling a regression line one way or the other compared to if that observation was not present. They can also indicate issues pertaining to the quality or data generation mechanism, and it is necessary to understand the context of outliers before deciding how to address them.

Author

AIML.com

Help us improve this post by suggesting in comments below:

– modifications to the text, and infographics
– video resources that offer clear explanations for this question
– code snippets and case studies relevant to this concept
– online blogs, and research publications that are a “must read” on this topic

Leave the first comment (Cancel Reply)

You must be logged in to post a comment.

Partner Ad

Join us on:

Find out all the ways that you can

Contribute

Partner Ad

Learn Data Science with Travis - your AI-powered tutor | LearnEngine.com

What is an Outlier?

Author

Leave the first comment (Cancel Reply)

Other Questions in Statistics