General References:
Useful References related to various topics:
Tentative Topics
Some combination of the following topics:
(Instructors are Listed in [ ] )
Week 0: June 21/22: [AM + AD]
R Introduction Quiz
Set seed to 100 using set.seed(100) command.
- Randomly generate 100 numbers with a mean 0 and standard deviation 1 and store these numbers in a vector called `two'
- What is the expected value of the generated numbers and the actual mean
- Write a function which takes in a number x and seed y and returns x randomly generated numbers with seed y
- Write a function which returns the mean median and mode in a vector given a vector inputs
- Generate 1000 numbers with a mean 0 and stdev 1, and find how many of them are greater than 0.2 with a single command. Store these as a vector called `six'
- Multiply the vectors `six' by `two' and store results in `tricky' - what is the result and why is there a warning?
- Install the `housingData' package and store the `fipsCounty' data into a data frame
- Which state appears the most times?
- Load the housing data into a dataframe. Which state has the most houses sold, which state has the highest average difference between list and selling price
- Plot a graph of list and selling price and find the outliers in the data. Does there appear to be a relation and is there a similarity between the outliers
Week 1: June 25-29 [AM + AD]
Week 3: July 9-13 [AD]
- R-Studio
- R-Markdown
- Math Operations
- Functions
- Basic Data Structures
- Statistical Features
- Reading and Cleaning Data
- Plotting Data
- Operations on Tables (Select, Filter, Mutate, Sumarize)
- Functions on Vectors and Lists
- Introduction
- Measuring Error
- Gradient Descent
- Regression Package in R
- Correlation
- Linear Model
- Multiple Linear Regression
- Splitting into training testing sets
- Different regression models, generating features, choosing features
- Different types of regressions and measuring error
- Problems and Drawbacks with Regression
Logistic Regression Decision Trees
- Multiple Regression
- Splitting
- Cross Validation
- Overfitting
Week 4: July 16-19 [SM +
AD]
Members |
Topic |
|
1 |
Vedang, Tirth, Pranjal |
Face Recognition using IRIS Scan How to match the person in the database given the measurements from the IRIS scan |
2 |
Bhaummi, Kajol, Vinita |
How find find the shops in geometric vicinity
which match the given criteria of the user. |
3 |
Sanket, Nidhi, Shivam |
Fingerprint Recognition via the LSH scheme
and studying the effect of varying the cell numbers for designing the hash functions. |
4. |
Vrajesh, Shruti, Aarsh |
Finding where to shop the grocery given the
database of barcodes of produce with price, deals,
expiry, etc. |
5 |
Panth, Ketul, Rushang |
How to analyze the success of business given
their annual reports. |
6 |
Rahul, Smit, Meet |
How to find the best deals from online stores
given multicriteria including price, shipping time, buy
local, etc. |
7 |
Juhi, Divyanshi |
Strategies for enabling remote desktops |