Machine Learning Concepts - Knowledge Base

What is Machine Learning?

Machine Learning (ML) is a subset of artificial intelligence (AI) that focuses on enabling systems to learn from data without being explicitly programmed. Instead of following predefined instructions, ML algorithms identify patterns, make predictions, and improve their performance over time through exposure to more data.

It has revolutionized various industries, from recommendation systems and image recognition to natural language processing and autonomous vehicles.

A simplified representation of the ML workflow.

Supervised Learning

In supervised learning, algorithms are trained on a labeled dataset. This means that for each input data point, there is a corresponding correct output or "label". The goal is to learn a mapping function from inputs to outputs.

Types of Supervised Learning:

Classification

Used to predict a categorical label. Examples include spam detection (spam/not spam) or image recognition (cat/dog).

Regression

Used to predict a continuous value. Examples include predicting house prices or stock market trends.

Common Algorithms:

Linear Regression
Logistic Regression
Support Vector Machines (SVM)
Decision Trees
Random Forests
K-Nearest Neighbors (KNN)

Unsupervised Learning

Unsupervised learning deals with unlabeled data. The algorithm is tasked with finding hidden patterns, structures, or relationships within the data without any prior knowledge of the correct output.

Types of Unsupervised Learning:

Clustering

Grouping data points into clusters based on their similarity. Examples include customer segmentation or anomaly detection.

Dimensionality Reduction

Reducing the number of features (variables) in a dataset while preserving important information. Examples include Principal Component Analysis (PCA) for simplifying complex data.

Association Rule Learning

Discovering relationships between variables in large datasets. A classic example is market basket analysis ("people who buy bread also tend to buy milk").

Common Algorithms:

K-Means Clustering
Hierarchical Clustering
Principal Component Analysis (PCA)
Apriori Algorithm

Reinforcement Learning

Reinforcement learning (RL) is a type of ML where an agent learns to make decisions by performing actions in an environment to maximize a cumulative reward. The agent learns through trial and error, receiving positive rewards for good actions and negative rewards (penalties) for bad ones.

RL is often used in robotics, game playing (like AlphaGo), and autonomous navigation.

The agent-environment interaction loop in Reinforcement Learning.

Deep Learning

Deep Learning is a subfield of Machine Learning that uses artificial neural networks with multiple layers (hence "deep"). These deep neural networks are capable of learning complex patterns and representations directly from raw data, often achieving state-of-the-art results in areas like image and speech recognition.

Key components include:

Neural Networks: Inspired by the structure of the human brain.
Layers: Input, hidden, and output layers, where each layer processes information.
Activation Functions: Introduce non-linearity, allowing networks to learn complex relationships.
Backpropagation: An algorithm used to train neural networks by adjusting weights based on errors.

Convolutional Neural Networks (CNNs)

Excellent for image processing tasks.

Recurrent Neural Networks (RNNs)

Ideal for sequential data like text and time series.

Key Terms & Concepts

Features

The individual measurable properties or characteristics of a phenomenon being observed. In a dataset, these are typically the columns.

Labels (Target Variable)

The output variable that we are trying to predict in a supervised learning problem.

Training Data

The dataset used to train the ML model. It includes features and corresponding labels (for supervised learning).

Testing Data

A separate dataset used to evaluate the performance of the trained model. It should not have been seen by the model during training.

Model Evaluation

The process of assessing how well a trained model performs on unseen data. Metrics vary depending on the task (e.g., accuracy, precision, recall, RMSE).

Overfitting

When a model learns the training data too well, including its noise, leading to poor performance on new data.

Underfitting

When a model is too simple to capture the underlying patterns in the data, resulting in poor performance on both training and testing data.

Hyperparameters

Parameters that are set before the learning process begins and control the learning process itself (e.g., learning rate, number of layers in a neural network).