Last updated
Kurskode
dataminr
Varighet
14 timer (vanligvis 2 dag inkludert pauser)
Krav
Good R knowledge.
Oversikt
R er et gratis programmeringsspråk med åpen kildekode for statistisk databehandling, dataanalyse og grafikk. R brukes av et økende antall ledere og dataanalytikere i selskaper og akademia. R har et bredt utvalg av pakker for data mining.
Machine Translated
Kursplan
Sources of methods
- Artificial intelligence
- Machine learning
- Statistics
- Sources of data
Pre processing of data
- Data Import/Export
- Data Exploration and Visualization
- Dimensionality Reduction
- Dealing with missing values
- R Packages
Data mining main tasks
- Automatic or semi-automatic analysis of large quantities of data
- Extracting previously unknown interesting patterns
- groups of data records (cluster analysis)
- unusual records (anomaly detection)
- dependencies (association rule mining)
Data mining
- Anomaly detection (Outlier/change/deviation detection)
- Association rule learning (Dependency modeling)
- Clustering
- Classification
- Regression
- Summarization
- Frequent Pattern Mining
- Text Mining
- Decision Trees
- Regression
- Neural Networks
- Sequence Mining
- Frequent Pattern Mining
Data dredging, data fishing, data snooping