Upcoming on Fall 2024 – CPSC 541- Regression Analysis
This course covers routinely used regression-based statistical analytical techniques with a focus on agricultural applications.
Note that it has been several years since this course was last taught, and it will have major updates to the syllabus and the code works.
This year, we will leverage PraireLearn and Jupyter notebooks for coding exercises, allowing both R and Python for analytics.
Although most of the techniques and topics covered here can be encountered in various other stats courses offered across campus, this course focuses on understanding and working with real-world data, specifically teaching how to understand and pick analytical methods to ask and answer your own research questions with credibility.
Topics & Techniques Covered
Non-exhaustive
- Sampling Distributions
- Common families of Statistical Distributions
- Linear Regression
- Logistic Regression
- Non-linear Regression
- Curve Fitting
- Variance and Variance Components
- Sources of Error
- Goodness-Of-Fit
- Bootstrapping and Cross Validation
- Ensemble Methods
- Decision Trees
- Random Forest
- Gradient Descent
- Regularization Techniques
- Interpolation
Tools and Platforms
Basic Python – libraries
- Pandas
- Numpy
- Scikit-learn
- SciPy
- Seaborn
- Matplotlib
Project Management Basics & Platforms
- Google Sheets
- Monday.com
- SmartSheets
Collaborative Computing, Version Control & IDEs
- Github/GitLab
- Colab
- Jupyter
- PyCharm / IntelliJ / VirtualStudio
- R-Studio