MindMap Gallery data mining
The following summarizes the knowledge content of data mining, including introduction to data mining, data preprocessing, Bayesian theory, decision tree classifier, neural network, and support vector machine.
Edited at 2021-12-20 22:24:31One Hundred Years of Solitude is the masterpiece of Gabriel Garcia Marquez. Reading this book begins with making sense of the characters' relationships, which are centered on the Buendía family and tells the story of the family's prosperity and decline, internal relationships and political struggles, self-mixing and rebirth over the course of a hundred years.
One Hundred Years of Solitude is the masterpiece of Gabriel Garcia Marquez. Reading this book begins with making sense of the characters' relationships, which are centered on the Buendía family and tells the story of the family's prosperity and decline, internal relationships and political struggles, self-mixing and rebirth over the course of a hundred years.
Project management is the process of applying specialized knowledge, skills, tools, and methods to project activities so that the project can achieve or exceed the set needs and expectations within the constraints of limited resources. This diagram provides a comprehensive overview of the 8 components of the project management process and can be used as a generic template for direct application.
One Hundred Years of Solitude is the masterpiece of Gabriel Garcia Marquez. Reading this book begins with making sense of the characters' relationships, which are centered on the Buendía family and tells the story of the family's prosperity and decline, internal relationships and political struggles, self-mixing and rebirth over the course of a hundred years.
One Hundred Years of Solitude is the masterpiece of Gabriel Garcia Marquez. Reading this book begins with making sense of the characters' relationships, which are centered on the Buendía family and tells the story of the family's prosperity and decline, internal relationships and political struggles, self-mixing and rebirth over the course of a hundred years.
Project management is the process of applying specialized knowledge, skills, tools, and methods to project activities so that the project can achieve or exceed the set needs and expectations within the constraints of limited resources. This diagram provides a comprehensive overview of the 8 components of the project management process and can be used as a generic template for direct application.
data mining
CH1 Introduction to Data Mining
Data mining, machine learning and artificial intelligence
Machine learning framework
Algorithm model design choices
Training set
test set
Machine learning scenarios
supervised learning
semi-supervised learning
transfer learning
unsupervised learning
reinforcement learning
Machine learning tasks
return
Classification
structure learning
machine learning algorithm
Linear
nonlinear
deep learning
Support Vector Machines
decision tree
K nearest neighbor KNN
Introduction to learning materials and data mining
data
data attributes
Qualitative attributes and quantitative attributes
Discrete and continuous attributes
Data storage and issues
storage
physical type
logical type
Data preprocessing
data integration
Data cleaning
Data curation
data conversion
Common problems and methods in data mining
data mining technology
Classification
confusion matrix
ROC curve
ROC curve drawing
ROC curve application
AUC value
clustering
distance measure
algorithm
application
Association rules
return
Data preprocessing (pre)
Data cleaning
Data Integration
data conversion
data reduction
CH2 data preprocessing
Missing, outlier, duplicate
Handle missing data
outliers
local outlier factor
Data cleaning
data conversion
Data description
Feature selection
Feature extraction
Data conversion and description
data conversion
Property category
type conversion
sampling
Imbalanced data set
upsample
edge sampling
standardization
Data description
Basic description
Correlation coefficient
Pearson product-moment correlation coefficient
Pearson chi-square test
Feature selection and feature extraction
entropy
amount of information
information gain
Principal Component Analysis PCA
Linear discriminant analysis LDA
CH3 Bayesian Theory & Decision Tree Classifier
Naive Bayes
Bayesian theory
Example: Checking for Cancer
Naive Bayes classifier
conditionally independent
Laplacian smoothing
decision tree
ID3
entropy
Attribute selection
ID3 framework
Decision tree pruning classification
Entropy deviation
CART decision tree
CART classification tree
Gini index
CART regression tree
optimal split point
pruning
CH4 Neural Network
perceptron
For linearly separable data sets
AND or gate
gradient descent method
Loss function derivation
Least squares loss function
NAND NAND gate
Other linear models
linear regression
Empirical error
Parameter calculation
regular term
Logistic function
Logistic curve
Logistic distribution
logistic regression
likelihood function
maximum likelihood estimation
Cross entropy & KL divergence
Multi-classification problem
Softmax returns
multilayer perceptron
CNN-RNN-Transformers
CH5 support vector machine