PSTAT 5A: Understanding Data

Course Syllabus

Instructor
Quarter
Ethan Marzban

Summer Session A, 2023

WELCOME TO PSTAT 5A! I am very excited to introduce to you the wonderful fields of Statistics and Data Science. As our world becomes ever more saturated with data, the need for data literacy becomes increasingly important. By the end of this course, I hope you will be able to think critically about statistical studies and results, understand how data can be used to simultaneously inform and manipulate, and begin applying your newfound techniques to your future endeavors. I am very much looking forward to a great quarter with all of you!

— Ethan

Lecture Information

Lecture Times and Locations

M, T, W, Th: 11:00am - 12:20pm in ILP 1101

Course Staff

Instructor: Ethan P. Marzban
Email: epmarzban@pstat.ucsb.edu (Please use only in case of emergency)
Help Hours:
  • Tuesdays: 4:30pm - 5:30pm (Zoom)
  • Thursdays: 2:30pm - 3:30pm (SH 5607 F)
TAs
Mengrui Zhang
(mengrui@ucsb.edu)
Olivier Mulkin
(omulkin@ucsb.edu)

Schedule of Sections

Times TA Location
MW 12:30 - 1:20pm Olivier Mulkin PHELPS 1513
MW 2:00 - 2:50pm Mengrui Zhang PHELPS 1513
MW 3:00 - 3:50pm Mengrui Zhang PHELPS 1513

Course Description

The official description of this course, from the Course Catalog, is:

Introduction to data science. Concepts of statistical thinking. Topics include random variables, sampling distributions, hypothesis testing, correlation and regression. Visualizing, analyzing and interpreting real world data using Python. Computing labs required.

Textbook(s)

This quarter, we do not have a required textbook- the lecture slides and lab activities are designed to be self-sufficient. However, the following textbooks are highly recommended:

  • OpenIntro: Statistics. David Diez, Mine Çetinkaya-Rundel, and Christopher D Barr. (free version, courtesy of the authors, available at https://leanpub.com/os)

  • Computational and Inferential Thinking: The Foundations of Data Science. Ani Adhikari and John DeNero. (available at: https://www.inferentialthinking.com)

  • StatClass (2nd Edition, Revised). Dawn E. Holmes and Lubella A. Lenaburg

Course Components

The following are the assignments and metrics that will be used to compute your final grade in this course:

Labs

Every Monday you will work through a Lab worksheet that will be posted to the website by the previous day. You will then have until 40 minutes after the end of your scheduled section to submit your work through Gradescope. Your TA will explain more about the structure of lab during your first Section meeting of the quarter.

Quizzes

Quizzes will be administered asynchronously on Fridays, through Gradescope. Specifically, the quiz will remain open from 10am until 11:59pm, and you must find 40 consecutive minutes to take the quiz. (Consecutive means you cannot start the quiz, and then come back to it later- once you start, you will have 40 minutes to both complete the quiz as well as upload your work). I encourage you to spend 25 minutes working on the quiz, and 15 minutes uploading. No quiz scores will be dropped. There are no quizzes in Exam Weeks.

Exams

There are two midterms and a final exam for this class. You are required to take all three exams; failure to do so will result in an automatic grade of “F”, so please ensure you are able to take the exams on the dates listed below.

Exam Dates:
  • Midterm 1 is scheduled to take place Thursday July 6 from 11:00am - 12:20pm (lecture time)
  • Midterm 2 is scheduled to take place Thursday July 20 from 11:00am - 12:20pm (lecture time)
  • The final is scheduled to take place FRIDAY, AUGUST 4 from 4 - 7pm

All exams are scheduled to take place in our lecture classroom, ILP 1101. Unless stated otherwise, all exams will be cumulative.

Homework:

Homework for this class will be neither collected nor graded. However, the weekly quizzes will be based heavily (though potentially not entirely) off the homework assignments. Homework assignments will be released on Mondays, along with their final answers. Solutions will be posted biweekly, on the Tuesdays before exams. As such, you should not rely on the solutions to help you study; instead, please come to office hours to check your work regularly!

Schedule of Due Dates

A tentative schedule of release and due dates can be found here

Grading Scheme

Your final grade will be computed using the following weights:

Labs: 10%
Quizzes: 15%
Midterm 1 20%
Midterm 2 20%
Final Examination: 35%

Please note that late submissions for any of the above will not be accepted. Additionally, make-up exams cannot be accommodated.

Your final letter grade will be issued according to the following scheme (cutoffs between plusses and minuses will be calculated at the end of the quarter):

  • A – A+: 90 – 100%
  • B – B+: 80 – 89.99%
  • C – C+: 70 – 79.99%
  • D – D+ : 60 – 69.99%
  • F: 0 – 59.99%

I have elected to adopt an uncurved grading scheme to eliminate any sense of “competition” among students; I highly encourage you all to collaborate with and uplift each other. Having said that, I will certainly consider adjusting the cutoffs (naturally, in everyone’s favor) at the end of the quarter if necessary.

Academic Integrity

As a member of the UCSB community, it is expected that you will act with academic integrity. This means, among other things, that the work you submit should be entirely your own and not copied from any external sources. Collaboration on non-quiz and non-exam assignments is perfectly acceptable (and even encouraged), but the work you submit should still be your own; you can’t have someone else write up solutions for you.

Anyone found guilty of academic misconduct will be reported to the Academic Senate, and will receive at minimum a failing grade on the assignment in question; further actions may also include failing the course, and marks being made on permanent records. Depending on the severity of the infraction, expulsion is also a possibility.

Basically, don’t cheat- please! If you’re ever struggling with course material, please come talk to me or the TA’s. We are truly here for you, and want only the best for you.

Intellectual Property

You’ve probably seen a clause on other syllabi stating something to the effect of “all material in this course is the intellectual property of myself and may not be shared with anyone outside this class without my explicit written permission.”

Though this does still hold true for this course, I will be making most course-related material available on a public GitHub site, which can be accessed here: https://pstat5a.github.io.

Disabled Students Program (DSP)

If you have a disability, or otherwise require accommodations for the exams and/or quizzes please reach out to the Disabled Students Program (DSP) ASAP to ensure your request(s) for accommodation can be processed. We ask that all requests be logged at least a week in advance, to ensure the system enough time to process. Please note that we cannot grant any requests for accommodations unless they come to us from DSP directly.

Technology Needs

As a part of this course, you will be required to program in Python. Though the Lab Sections take place in specially designed classrooms that come equipped with computers, your homework and quizzes may cover Python-related questions, which means we expect you to have access to a laptop capable of connecting to the internet. If you do not currently possess such a laptop, please check out UCSB’s Basic Needs Resource page on Technology Resources to try and acquire one.

Section Switching

As mentioned above, Sections (both Discussion and Lab) take place in special “Collaborate Classrooms” which are equipped with laptops. There are a fixed number of seats and laptops in these classrooms, meaning we cannot under any circumstance over-enroll sections. Therefore, if you want to switch section unofficially (we do not have the ability to switch your official enrollment through GOLD), please follow the steps at this link. Any requests to switch sections that do not adhere to the guidelines posted at that link will be ignored.

Email Policy

Please note that we have a no-email (including Canvas messages) policy, except for emergencies; I leave it to you to decide what’s an ‘emergency’. Please bring all of your questions to me during Office Hours or at the end of lectures. (Also, please note that DMs [Direct Messages] sent to the course staff through Discord will be ignored.) Thank you!

Disclaimer

The instructor reserves the right to modify this syllabus if he deems such modifications academically advisable. Such modifications, should they occur, will be announced publicly.