Machine Learning and ML Patterns

Machine learning is to program computers to improve a performance standard using example data or experience. Learning is used when humans are unable to explain their expertise (speech recognition), Solution changes in time (routing on a computer network), Solution needs to be adapted to particular cases (user biometrics)

What we’re talking about is “learning”:

Given a Dataset D, a task T, and a performance measure M, a computer system is said to learn D to run the task T if, after learning the system performance on T progresses as measured by M. In other words, the learned model supports the system to perform T better than no learning.

A classic Example

Recognizing text may be tough like S & 5 similarly O & 0.

An Example Application

The emergency room of the hospital measures n the number of variables (e.g. blood pressure, age, etc.) of the newly-received patients. A decision should be taken if the new patient is to be included in the intensive care unit. Due to the high cost of the ICU, patients who can survive less than a month will receive a higher priority.

Problem: Predict high-risk patients and discriminate against low-risk patients.

Another Example

The credit card company receives thousands of requests for new cards. Each application contains information about the candidate age, marital status, annual salary, outstanding debts, punctuation,

Problem: Deciding to approve or classify the application in two categories, approving it and not accepting it.

More Examples

Recognize patterns: Facial identities or facial expressions, handwritten or spoken words, medical images.

Model generation: Image generation or motion sequences.

Recognize anomalies: Unusual patterns of sensor readings in a nuclear power plant or an unusual sound in the car’s engine.

Prediction: Future stock prices or exchange rates

Some Web Examples

The web contains a lot of data. Tasks with very large datasets often use machine learning, especially if the data is noisy or unstable.

Spam filtering, fraud detection: The enemy adapts, we must adapt to it as well.

Recommendation systems: lots of noisy data. Recommend valued data.

Request Information: Search for documents or images with similar content.

Data visualization: Displaying a massive database in the revealing form.

Learning Algorithm

Learning: Learn the model using training data.

Testing: Test the model using unnoticed test data to evaluate model accuracy.

Learning Patterns

Supervised Learning
- Classification
- Regression
Unsupervised Learning
- Clustering
Reinforcement Learning
- Associative RL
- Non-Associative RL

Supervised Learning

Classification

Predicts categorical class labels. Classifies data based on the training set and the values (class labels) in a classifying attribute and uses it in classifying new data. Credit scoring. The separation between low-risk and high-risk clients from their income and savings.

Classification Applications

Pattern recognition
Face detection and recognition
Character recognition
Speech recognition
Sensor fusion: Associate various modalities; e.g., visual (lip image) and sound for speech
Medical diagnosis

Regression

Example: Price of a used car

x: car attributes

y: pricey = g (x | θ ) g ( ) model, θ parameters

Regression Applications

Navigating a car: Angle of the steering wheel & Kinematics of a robot arm

α1= g1(x,y)

α2= g2(x,y)

Prediction of future cases
Use the regulation to guess the output for upcoming inputs
Knowledge extraction
Learning a rule from data
Compression
Finding a rule simpler than the data it explains
Outlier detection
Exclusions that are not enclosed by the rule, e.g., fraud

Unsupervised Learning

Learning “what normally happens”, No predefined output

Clustering: Grouping similar instances

Example applications

Customer segmentation in CRM
Image compression: Colour quantization
Bioinformatics: Learning motifs

Clustering

Clustering is a methodology for finding similar collections of data that are called clusters. Groups data instances that are similar to each other (closed) in a cluster and very different (remote) data instances in one of the other different clusters. Clustering is an unsupervised learning task because there are no class values that represent a particular group of examined data instances. which in case of supervised learning.

Illustration

Data collection consists of three natural groups of data points, namely 3 natural clusters.

What is clustering for?

Let us understand some real-life examples

Example: Groups of people of similar size who collect T-shirts “small “, “medium” and “big “. Tailor-made for each person: too expensive. One size: not suitable for everyone.

Example: In marketing, fragment customers by their similarities, to do directed marketing.

Example: About the collection of text documents, we want to organize them according to their content representations, which produce a hierarchy of topics. Clustering is one of the most necessary data mining techniques. It has a long history and is used in almost all areas, for example, medicine, psychology, sociology, botany, biology, marketing, insurance, libraries, etc.

In recent years, due to an accelerated increase in online documents, the text clustering becomes essential.

ASPECTS OF CLUSTERING

A clustering algorithm:

Partitioning clustering

Hierarchical clustering

A distance (similarity, or dissimilarity) function
Clustering quality: The quality of a clustering result depends on the algorithm, the distance function, and the application.

Reinforcement Learning

Reinforcement learning is supervised learning in which limited information of the desired outputs is known but Complete knowledge of the environment is not available; only basic benefit or reward information. In other words, a critic instead of a teacher leads the learning process. Reinforcement learning has roots in experimental studies of animals to learn, teaching a dog through positive (“Good dog “, something to eat) and negative (“Bad Dog “, nothing to eat) reinforcement.

Associative

Associating action and stimuli. In other words, an incentive to draw action to develop the reinforcement information received from the environment.

Non-associative

Choose an action instead of actions associated with stimuli. The only input acknowledged from the surroundings is reinforcement information. Examples include genetic algorithms

The reinforcement signal can be any signal that evaluates the actions of the learning system, not only a sign of success/failure that it takes in real values often, and the learning objective is to maximize its expected value. The critic does not tell the learning system directly on how to change his actions. Reinforcement learning algorithms are selection processes. There must be a variation in the process of generating actions so that the effects of the alternative actions can be compared with the best option.

Problem Solving