layout | title | permalink |
---|---|---|
page |
Schedule |
/schedule/ |
The schedule is given by day, in this case M means Monday, T means Tuesday, and R means Thursday. We may or may not cover all the topics on the given day. If we do not, the content will be made available online with disclaimers as to whether or not it will be on the midterm or not.
Note: All homework is due on Sunday at 11:59 PM CDT by e-mail to balamut2 [AT] illinois [dot] edu.
- 6/13/16 - M: Course Introduction, Review of STAT 200, Compiler vs. Interpreter
- 6/14/16 - T: Reproducible Research using
knitr
,markdown
,rmarkdown
,git
, and GitHub - 6/16/16 - R: Variables, Data Types, Data Structures, and Vectors
- Read Data structures and Subsetting in Advanced R
- Additional Reading:
- 6/20/16 - M: Grouping 101, Control Statements, Loops, Big O
- Read Iteration in R for Data Science
- Read Circle 2: Growing Objects in R Inferno
- 6/21/16 - T: Functions, Recursion, Memoization, and Benchmarking
- >> You must be in a group by now! <<
- Read Functions in R for Data Science
- Read Performance chapter until "Language performance" in Advanced R
- Read Memoization witin Functional Operators in Advanced R
- Read Circle 5: Not Writing Functions in R Inferno
- 6/23/16 - R: Numerical Stability, Errors and Debugging Techniques (A Study in Calculating Variance)
- Read Debugging, condition handling, and defensive programming in Advanced R
- Read Circle 1: Falling into the Floating Point Trap
- Read Introduction to Circle 8: in R Inferno
- Create an account on StackOverflow and look at the R tag questions. Can you answer some? Send me a tweet @axiomsofxyz if you do!
-
6/27/16 - M: Advanced Data Structures, Memory Usage and Profiling
- Read Profiling and Memory in Advanced R
-
6/28/16 - T: Data Management Techniques, Regular Expressions (regex), Dates and Times
- Read Strings and Dates and Times in R for Data Science
- Read the Tidy Data paper in Journal of Statistical Software by Hadley Wickham
-
6/30/16 - R: S3 Objects and Ellipses (A study in Linear Regression)
-
4/01/16 - F: >> Submit Group Proposal! <<
- 7/04/16 - M: No Class for Independence Day
- 7/05/16 - T: Group Project Proposals / Review Day
- 7/07/16 - R: Midterm
- Midterm Concept List
- Sample Midterm
- Sample Midterm Solutions (Done in class...) (R Code in Class)
- 7/11/16 - M: Pipes and Web Scrapping
- Read Pipes in R for Data Science
- Read SelectorGadget vignette in
rvest
- 7/12/16 - T: Exploratory Data Analysis (EDA) - Quantitative
- 7/14/16 - R: Exploratory Data Analysis (EDA) - Visual
- Read Data Visualization
- 7/18/16 - M: Shiny
- 7/19/16 - T:
- >> Submit Group Progress Update! <<
- 7/21/16 - R: Package Development (A study in contributing)
- Read Package Structure, Object Documentation and Testing in R Packages
- 7/25/16 - M: Welcome to Rcpp Country
- 7/26/16 - T: Parallel Computing with R
- 7/28/16 - R: Parallel Computing with Rcpp
- 8/01/16 - M: Unix Shell
- 8/02/16 - T: Distributed Computing with the Illinois Campus Cluster (ICC)
- Look at the Beginner's Guide
- Read Working with R on a Cluster
- Today is Group Presentation day!
- 8/04/16 - R: Reading Day (No Class!)
Final Reports are due on: Sunday August 7th, 2016 at 11:59 PM CDT
Here are a few topics I would encourage you to look into more on your own:
- S4 and RC Objects (A study in probability distributions)
- The Apply Family (A study in Efficient Vectorization)
- Read Circle 4: Over-Vectorizing in R Inferno