Upcoming on Fall 2024 – CPSC 541- Regression Analysis (CRN: 46812+46819) & CPSC 499-Statistical Learning in Ag (CRN 79930)
These two courses essentially have the same content for Topics and Techniques- however, the 499 version is an accelerated 8 week module intended for CS+ undergrad majors and will NOT cover the Tools and Platforms listed below- and requires prior knowledge of these.
I would advise chatting with me about your compute skill level and background prior to registration.
For course and session information, check the links below:
This course covers routinely used regression-based statistical analytical techniques with a focus on agricultural applications.
Note that it has been several years since this course was last taught, and it will have major updates to the syllabus and the code works.
This year, we will leverage PraireLearn and Jupyter notebooks for coding exercises, allowing both R and Python for analytics.
Although most of the techniques and topics covered here can be encountered in various other stats courses offered across campus, this course focuses on understanding and working with real-world data, specifically teaching how to understand and pick analytical methods to ask and answer your own research questions with credibility.
Topics & Techniques Covered
Non-exhaustive
- Sampling Distributions
- Common families of Statistical Distributions
- Linear Regression
- Logistic Regression
- Non-linear Regression
- Curve Fitting
- Variance and Variance Components
- Sources of Error
- Goodness-Of-Fit
- Bootstrapping and Cross Validation
- Ensemble Methods
- Decision Trees
- Random Forest
- Gradient Descent
- Regularization Techniques
- Interpolation
Tools and Platforms
Basic Python – libraries
- Pandas
- Numpy
- Scikit-learn
- SciPy
- Seaborn
- Matplotlib
Project Management Basics & Platforms
- Google Sheets
- Monday.com
- SmartSheets
Collaborative Computing, Version Control & IDEs
- Github/GitLab
- Colab
- Jupyter
- PyCharm / IntelliJ / VirtualStudio
- R-Studio
