Instructor: Dr. Nayel Bettache.
Office: Surge B 158, 220 Tower Road, Ithaca, NY 14853.
Email: nayel [dot] bettache [at] cornell [dot] edu
The following schedule is a general outline that we plan to follow. Depending on the pace of the course, some topics may be explored in greater detail, while others might be adjusted or omitted. Assignments are currently planned to be released on Thursdays of the corresponding week, though this is subject to change.
This course provides an introduction to the fundamental concepts and techniques in statistical learning and machine learning, with a focus on understanding the theoretical underpinnings of various machine learning algorithms and their implementation in R (and tentatively in Python).
The lectures for this course will be held on Tuesdays and Thursdays from 11:40am to 12:55pm in Phillips Hall, room 101.
The materials for this class will be uploaded on this page. It is entirely your responsibility to download them as needed. A brief description of these materials follows.
Your grade in this class will be based on homeworks and exams, as detailed below.
You will receive four assignments counting towards 20 % of the grade. The lowest homework score will be dropped, with the remaining three assignments weighted equally.
For students in 5740, you may encounter one or two additional questions per assignment, which are required for 5740 but optional for 3740 (offering bonus points for 3740 students).
Late homework submissions will incur a 20% penalty if submitted within 24 hours past the deadline; submissions beyond that will not be accepted. Solutions will be posted on the course website two days after the submission window closes. Please refer to the Course Schedule above for deadlines.
There will be three in-class tests, each during regular lecture times, collectively accounting for 50% of your final grade. The lowest midterm score will be dropped. For 5740 both remaining midterms will carry equal weight. For 3740 the remaining lowest score will be weighted 1/3 and the best score will be weighted 2/3.
Each test will cover the material discussed in class up to the exam date, including problems solved in lectures and all the homeworks due before the exam. I will provide an overview of the exam in class and post a detailed outline of the required materials before each test.
All exams are closed-book, and the use of any electronic devices is strictly prohibited. This includes computers, calculators, cellphones, and other electronic gadgets.
Students with approved extended time: please see the section on accommodations below.
The final project for this course will be a take-home data analysis assignment, designed to be completed at the end of the semester. The project will require students to work in groups, and the datasets along with specific questions for analysis will be distributed around October 20th. Students are expected to form groups of 3 to 4 members. These groups should be finalized and approved by the instructor no later than November 1st. Any student who have not joined a group by this deadline will be assigned to a group by the instructor.
The final report, which documents the results of your analysis, must be submitted as a PDF file by December 16th. If the report is submitted late, a 20% penalty will be applied if it is received within 24 hours after the deadline; reports submitted after this period will not be accepted. The report should be no longer than 8 pages, formatted in a standard style with a font size of 12. It should demonstrate the application of appropriate methods discussed throughout the course, present findings clearly, and provide accurate interpretations of the results.
All data analysis must be conducted using R or Python, and the scripts used in your analysis must be submitted alongside the report, although these scripts will not count towards the 8 pages limit. Your project will be graded based on the effective application of the appropriate methods, the clarity and organization of the report, the accuracy of the interpretations, and the reproducibility of your analysis using the provided scripts.
This project is an integral part of the course and will allow you to apply the knowledge and skills you have developed throughout the semester in a practical and meaningful way. It is an opportunity to demonstrate your understanding of the course material and your ability to conduct and present a thorough data analysis.
There is no curving of grades in this class. Your final grade will be based entirely on your performance.
Students with disabilities are encouraged to engage fully in this course, and your access needs are a priority. To ensure that your approved accommodations are arranged in a timely manner, you must request your accommodation letter via the SDS Student Portal by August 31st.
For students who are already registered with the Student Disability Services (SDS), please note that once you request your accommodation letter, it may take up to 48 hours for the letter to be processed and sent to me. If you are not yet registered with SDS, be aware that the process to register and receive new accommodations can take up to three weeks. Once approved, you will be able to request your accommodation letter for this course.
If you are approved for accommodations later in the semester, it is important that you request your accommodation letter as soon as possible to avoid any delays in receiving the necessary support.
Regarding exam accommodations, this course is participating in the Alternative Testing Program (ATP). All exams will be centrally managed by the ATP, and relevant information will be communicated through SDS-testing@cornell.edu and your SDS Student Portal. It is important to stay informed by reading these communications and visiting sds.cornell.edu/atp for additional details about the ATP process.
Starting in Fall 2023, students no longer need to request each individual exam. However, if you have an academic conflict with a scheduled exam time, you must submit an ”exam request form” in the SDS Student Portal. All requests for conflict exams must be submitted no later than 10 business days prior to the exam date, and conflict exams will be scheduled at standard times.
For all relevant information and to manage your accommodations, please visit the SDS Student Portal at sds.cornell.edu.
Course materials provided in this class are the intellectual property of the instructor. Students are strictly prohibited from buying, selling, or distributing any course materials without the express permission of the instructor. Engaging in such unauthorized activities is considered academic misconduct and will be treated accordingly.
Every student in this course is expected to adhere to the Cornell University Code of Academic Integrity. All work submitted for academic credit must be the student’s own original work. The use of AI resources, including tools like ChatGPT, is strictly prohibited in this class.
The material provided below has been thoughtfully compiled by students from the Body Positive Cornell organization. It offers a well-researched and comprehensive list of well-being resources available on campus. For detailed information and guidance, please refer to the following resource: