One-Day Course
Date to be announced; 8:30am – 5:00pm
Dr. Neal Gallagher, Eigenvector Research, Wenatchee, WA
COURSE DESCRIPTION
Chemometrics without Equations (CWE) is designed for those who wish to explore the problem-solving power of machine learning tools without high level mathematics. Course emphasis is on proper development and interpretation of models as applied to real-life problems. The objective is to teach in a simple way so that participants will be better chemical data science practitioners and managers.
CWE, introduces Principal Components Analysis used for exploratory data analysis/pattern recognition, and regression methods Classical Least Squares and Partial Least Squares regression. Classification methods SIMCA and PLS Discriminant Analysis are also discussed. The course concludes with methods for model improvement with advanced preprocessing and variable selection.
WHO SHOULD ATTEND
Chemometrics without Equations was developed for chemists and bio-scientists who need to analyze multivariate chemical data and for those who manage these staff members. It covers the basics of pattern recognition, quantitative (regression) and qualitative (classification) models. Additional sections discuss how to improve models through advanced data preprocessing and variable selection. The course is relevant in a large number of analytical applications including process analysis, analytical chemistry, pharmaceuticals, sensory, medical devices, metabolomics, etc.
TOPICS
- Introduction
- What is chemometrics?
- Resources
- Pattern Recognition Motivation
- What is pattern recognition?
- Relevant measurements
- Principal Components Analysis
- What is PCA?
- Scores and loadings
- Interpretation
- Supervised and unsupervised pattern recognition
- Regression
- What is regression?
- Classical least squares (CLS)
- Inverse least squares (ILS)
- Principal components regression (PCR)
- Partial least squares regression (PLS)
- Classification
- What is classification?
- Classification based on PCA models: SIMCA
- Using regression for classification: PLS Discriminant Analysis (PLS-DA)
- Advanced Preprocessing
- What are the goals of preprocessing?
- Mean- and median-centering, autoscaling
- Normalization and standard normal variate
- Savitsky-Golay and filtering
- Generalized least squares weighting clutter suppression(GLS)
- Multiplicative scatter correction (MSC)
- Variable Selection
- Why do variable selection?
- Knowledge based selection
- Model based, e.g. on loadings
- Interval PLS (iPLS)
- Conclusions
ABOUT THE INSTRUCTOR
Dr. Neal B. Gallagher, is Vice President and co-founder of Eigenvector Research, Inc. established January, 1995. Recent research includes novel algorithms for iterative target detection in hyperspectral imaging and shift-invariant tri-linearity (SIT) for modeling hyphenated chromatography data. Neal has been intimately involved in chemometrics consulting, teaching short courses and software development including algorithms for detection, classification and quantification. Specific interests include hyperspectral imaging, process modeling, multi-variate curve resolution and classical least squares modeling.