Skip to content

Commit

Permalink
Final edits before sharing
Browse files Browse the repository at this point in the history
  • Loading branch information
rafalab committed Sep 3, 2024
1 parent 44920c6 commit 5b21916
Show file tree
Hide file tree
Showing 6 changed files with 58 additions and 122 deletions.
6 changes: 1 addition & 5 deletions docs/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -139,7 +139,6 @@ <h2 id="toc-title">Table of contents</h2>
<ul class="collapse">
<li><a href="#instructors" id="toc-instructors" class="nav-link" data-scroll-target="#instructors">Instructors</a></li>
</ul></li>
<li><a href="#section" id="toc-section" class="nav-link" data-scroll-target="#section"></a></li>
</ul>
<div class="toc-actions"><div><i class="bi bi-github"></i></div><div class="action-links"><p><a href="https://github.com/datasciencelabs/2024/blob/main/index.qmd" class="toc-action">View source</a></p><p><a href="https://github.com/datasciencelabs/2024/issues/new" class="toc-action">Report an issue</a></p></div></div></nav>
</div>
Expand Down Expand Up @@ -198,12 +197,9 @@ <h2 class="anchored" data-anchor-id="instructors">Instructors</h2>
<li>Nikhil Vytla</li>
<li>Yuan Wang</li>
</ul>
</section>
</section>
<section id="section" class="level1">
<h1></h1>


</section>
</section>

</main> <!-- /main -->
Expand Down
6 changes: 3 additions & 3 deletions docs/search.json
Original file line number Diff line number Diff line change
Expand Up @@ -53,7 +53,7 @@
"href": "syllabus.html#course-description",
"title": "Syllabus",
"section": "Course Description",
"text": "Course Description\nThis course introduces the following:\n\nUNIX/Linux shell.\nReproducible document preparation with RStudio, knitr, and markdown.\n\nVersion control with git and GitHub.\nR programming,\nData wrangling with dplyr and data.table.\nData visualization with ggplot2.\n\nWe also demonstrate how the following concepts are applied in data analysis:\n\nMonte Carlo simulations.\nStatistical modeling.\nHigh-dimensional data techniques, and\nMachine learning.\n\nWe do not cover the theory and details of these methods as they are covered in other courses.\nThroughout the course, we use motivating case studies and data analysis problem sets based on challenges similar to those you encounter in scientific research."
"text": "Course Description\nThis course introduces the following:\n\nUNIX/Linux shell.\nReproducible document preparation with RStudio, knitr, and markdown\nVersion control with git and GitHub\nR programming\nData wrangling with dplyr and data.table\nData visualization with ggplot2\n\nWe also demonstrate how the following concepts are applied in data analysis:\n\nMonte Carlo simulations\nStatistical modeling\nHigh-dimensional data techniques\nMachine learning\n\nWe do not cover the theory and details of these methods as they are covered in other courses.\nThroughout the course, we use motivating case studies and data analysis problem sets based on challenges similar to those you encounter in scientific research."
},
{
"objectID": "syllabus.html#weekly-course-structure",
Expand Down Expand Up @@ -102,13 +102,13 @@
"href": "syllabus.html#key-dates---subject-to-change-after-first-week-of-class",
"title": "Syllabus",
"section": "Key Dates - Subject to Change After First Week of Class",
"text": "Key Dates - Subject to Change After First Week of Class\n\n\n\nDate\nEvent\n\n\n\n\nSep 10\nProblem Set 1 due\n\n\nSep 13\nProblem Set 2 due\n\n\nSep 19\nProblem Set 3 due\n\n\nSep 26\nProblem Set 4 due\n\n\nSep 26\nProblem Set 5 due\n\n\nOct 11\nProblem Set 6 due\n\n\nOct 14\nNo class: Indigenous Peoples Day\n\n\nOct 16\nMidterm 1\n\n\nOct 23\nStart final project, obtain approval for personal project.\n\n\nNov 01\nProblem Set 7 due\n\n\nNov 11\nNo class: Veterans’ Day\n\n\nNov 01\nProblem Set 7 due\n\n\nNov 22\nProblem Set 8 due\n\n\nNov 25\nMidterm 2\n\n\nNov 27\nNo class: Thanksgiving Recess Begins\n\n\nDec 06\nProblem Set 9 due\n\n\nDec 13\nProblem Set 10 due\n\n\nDec 20\nFinal Project due"
"text": "Key Dates - Subject to Change After First Week of Class\n\n\n\n\n\n\n\nDate\nEvent\n\n\n\n\nOct 14\nNo class: Indigenous Peoples Day\n\n\nOct 16\nMidterm 1\n\n\nOct 23\nStart final project. Obtain approval if you want to do personal project instead.\n\n\nNov 11\nNo class: Veterans’ Day\n\n\nNov 25\nMidterm 2\n\n\nNov 27\nNo class: Thanksgiving Recess Begins\n\n\nDec 20\nFinal Project due"
},
{
"objectID": "syllabus.html#preliminary-schedule",
"href": "syllabus.html#preliminary-schedule",
"title": "Syllabus",
"section": "Preliminary Schedule",
"text": "Preliminary Schedule\n\n\n\nDates\nTopic\nLinks to readings and notes\n\n\n\n\nSep 04\nProductivity Tools\nGetting Started, Installing R and RStudio, Unix\n\n\nSep 09\nProductivity Tools\nRStudio Projects, Quarto Git and GitHub\n\n\nSep 10\nProblem Set 1 due\nDifficulty: easy\n\n\nSep 11\nR\nR Basics, Vectorization\n\n\nSep 13\nProblem Set 2 due\nDifficulty: easy\n\n\nSep 16, Sep 18\nR\ndplyr, dates and times, ggplot2\n\n\nSep 19\nProblem Set 3 due\nDifficulty: easy\n\n\nSep 23, Sep 25\nWrangling\nImporting data Locales Reshaping Data, Joining Tables, Extracting data from the web\n\n\nSep 26\nProblem Set 4 due\nDifficulty: medium\n\n\nSep 30, Oct 02\nData visualization\nDistributions, Dataviz Principles\n\n\nSep 26\nProblem Set 5 due\nDifficulty: medium\n\n\nOct 07, Oct 09\nProbability\nMonte Carlo, Random Variables & CLT\n\n\nOct 11\nProblem Set 6 due\nDifficulty: easy\n\n\nOct 14\nNo class\nIndigenous Peoples Day\n\n\nOct 16\nMidterm 1\nCover material from Sep 04-Oct 09\n\n\nOct 21, Oct 23\nInference\nParameters & Estimates, Confidence Intervals\n\n\nOct 28, Oct 30\nStatistical Models\nData-driven Models, Bayesian Statistics, Hierarchical Models\n\n\nNov 01\nProblem Set 7 due\nDifficulty: hard\n\n\nNov 04, Nov 06\nLinear models\nRegression, Multivariate Regression\n\n\nNov 11\nNo class\nVeterans’ Day\n\n\nNov 13\nLinear models\nMeasurement Error Models, Treatment Effect Models, Association Tests, Association Not Causation\n\n\nNov 01\nProblem Set 7 due\nDifficulty: hard\n\n\nNov 18, Nov 20\nHigh dimensional data\nMatrices in R, Applied Linear Algebra, Dimension Reduction\n\n\nNov 22\nProblem Set 8 due\nDifficulty: easy\n\n\nNov 25\nMidterm 2\nCover material from Sep 04-Nov 20\n\n\nNov 27\nNo class\nThanksgiving Recess Begins\n\n\nDec 02, Dec 04\nMachine Learning\nNotation and terminology, [Evaluation Metrics\n\n\nDec 06\nProblem Set 9 due\nDifficulty: easy\n\n\nDec 09, Dec 11\nMachine Learning\nResampling methods, ML algorithms, ML in practice\n\n\nDec 13\nProblem Set 10 due\nDifficulty: hard\n\n\nDec 16, Dec 18\nOther topics\n\n\n\nDec 20\nFinal Project due"
"text": "Preliminary Schedule\n\n\n\nDates\nTopic\nLinks to readings and notes\n\n\n\n\nSep 04\nProductivity Tools\nInstalling R and RStudio on Windows or Mac, Getting Started Unix\n\n\nSep 09\nProductivity Tools\nRStudio Projects, Quarto Git and GitHub\n\n\nSep 10\nProblem Set 1 due\nDifficulty: easy\n\n\nSep 11\nR\nR Basics, Vectorization\n\n\nSep 13\nProblem Set 2 due\nDifficulty: easy\n\n\nSep 16, Sep 18\nR\ndplyr, dates and times, ggplot2\n\n\nSep 19\nProblem Set 3 due\nDifficulty: easy\n\n\nSep 23, Sep 25\nWrangling\nImporting data Locales Reshaping Data, Joining Tables, Extracting data from the web\n\n\nSep 26\nProblem Set 4 due\nDifficulty: medium\n\n\nSep 30, Oct 02\nData visualization\nDistributions, Dataviz Principles\n\n\nOct 04\nProblem Set 5 due\nDifficulty: medium\n\n\nOct 07, Oct 09\nProbability\nMonte Carlo, Random Variables & CLT\n\n\nOct 11\nProblem Set 6 due\nDifficulty: easy\n\n\nOct 14\nNo class\nIndigenous Peoples Day\n\n\nOct 16\nMidterm 1\nCovers material from Sep 04-Oct 11\n\n\nOct 21, Oct 23\nInference\nParameters & Estimates, Confidence Intervals\n\n\nOct 28, Oct 30\nStatistical Models\nData-driven Models, Bayesian Statistics, Hierarchical Models\n\n\nNov 01\nProblem Set 7 due\nDifficulty: hard\n\n\nNov 04, Nov 06\nLinear models\nRegression, Multivariate Regression\n\n\nNov 11\nNo class\nVeterans’ Day\n\n\nNov 13\nLinear models\nMeasurement Error Models, Treatment Effect Models, Association Tests, Association Not Causation\n\n\nNov 15\nProblem Set 8 due\nDifficulty: hard\n\n\nNov 18, Nov 20\nHigh dimensional data\nMatrices in R, Applied Linear Algebra, Dimension Reduction\n\n\nNov 22\nProblem Set 9 due\nDifficulty: easy\n\n\nNov 25\nMidterm 2\nCovers material from Sep 04-Nov 22\n\n\nNov 27\nNo class\nThanksgiving Recess Begins\n\n\nDec 02, Dec 04\nMachine Learning\nNotation and terminology, Evaluation Metrics, conditional probabilities, smoothing\n\n\nDec 09, Dec 11\nMachine Learning\nResampling methods, ML algorithms, ML in practice\n\n\nDec 13\nProblem Set 10 due\nDifficulty: hard\n\n\nDec 16, Dec 18\nOther topics\n\n\n\nDec 20\nFinal Project due"
}
]
6 changes: 3 additions & 3 deletions docs/sitemap.xml
Original file line number Diff line number Diff line change
Expand Up @@ -2,14 +2,14 @@
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
<url>
<loc>http://datasciencelabs.github.io/2024/index.html</loc>
<lastmod>2024-09-02T20:28:45.364Z</lastmod>
<lastmod>2024-09-03T02:25:22.113Z</lastmod>
</url>
<url>
<loc>http://datasciencelabs.github.io/2024/downloading-course-materials.html</loc>
<lastmod>2024-09-02T20:28:45.367Z</lastmod>
<lastmod>2024-09-03T02:25:22.116Z</lastmod>
</url>
<url>
<loc>http://datasciencelabs.github.io/2024/syllabus.html</loc>
<lastmod>2024-09-02T20:28:45.373Z</lastmod>
<lastmod>2024-09-03T02:31:49.342Z</lastmod>
</url>
</urlset>
106 changes: 30 additions & 76 deletions docs/syllabus.html
Original file line number Diff line number Diff line change
Expand Up @@ -171,10 +171,10 @@ <h1 class="title">Syllabus</h1>
<section id="course-information" class="level2">
<h2 class="anchored" data-anchor-id="course-information">Course Information</h2>
<ul>
<li><strong>BST 260 Introduction to Data Science</strong></li>
<li><strong>Kresge 202A and 202B (HSPH)</strong></li>
<li><strong>Monday 09:45 AM - 11:15 AM; Wednesday 09:45 AM - 11:15 AM</strong></li>
<li><strong>Lecture notes: <a href="https://datasciencelabs.github.io/2024/">https://datasciencelabs.github.io/2024/</a></strong></li>
<li>BST 260 Introduction to Data Science</li>
<li>Kresge 202A and 202B (HSPH)</li>
<li>Monday 09:45 AM - 11:15 AM; Wednesday 09:45 AM - 11:15 AM</li>
<li>Lecture notes: <a href="https://datasciencelabs.github.io/2024/">https://datasciencelabs.github.io/2024/</a></li>
</ul>
</section>
<section id="prerequisites" class="level2">
Expand All @@ -194,19 +194,18 @@ <h2 class="anchored" data-anchor-id="course-description">Course Description</h2>
<p>This course introduces the following:</p>
<ul>
<li>UNIX/Linux shell.</li>
<li>Reproducible document preparation with RStudio, knitr, and markdown.<br>
</li>
<li>Version control with git and GitHub.</li>
<li>R programming,</li>
<li>Data wrangling with dplyr and data.table.</li>
<li>Data visualization with ggplot2.</li>
<li>Reproducible document preparation with RStudio, knitr, and markdown</li>
<li>Version control with git and GitHub</li>
<li>R programming</li>
<li>Data wrangling with dplyr and data.table</li>
<li>Data visualization with ggplot2</li>
</ul>
<p>We also demonstrate how the following concepts are applied in data analysis:</p>
<ul>
<li>Monte Carlo simulations.</li>
<li>Statistical modeling.</li>
<li>High-dimensional data techniques, and</li>
<li>Machine learning.</li>
<li>Monte Carlo simulations</li>
<li>Statistical modeling</li>
<li>High-dimensional data techniques</li>
<li>Machine learning</li>
</ul>
<p>We do not cover the theory and details of these methods as they are covered in other courses.</p>
<p>Throughout the course, <strong>we use motivating case studies and data analysis problem sets based on challenges similar to those you encounter in scientific research</strong>.</p>
Expand Down Expand Up @@ -282,6 +281,10 @@ <h2 class="anchored" data-anchor-id="chatgpt-policy">ChatGPT Policy</h2>
<section id="key-dates---subject-to-change-after-first-week-of-class" class="level2">
<h2 class="anchored" data-anchor-id="key-dates---subject-to-change-after-first-week-of-class">Key Dates - Subject to Change After First Week of Class</h2>
<table class="table">
<colgroup>
<col style="width: 46%">
<col style="width: 53%">
</colgroup>
<thead>
<tr class="header">
<th>Date</th>
Expand All @@ -290,30 +293,6 @@ <h2 class="anchored" data-anchor-id="key-dates---subject-to-change-after-first-w
</thead>
<tbody>
<tr class="odd">
<td>Sep 10</td>
<td>Problem Set 1 due</td>
</tr>
<tr class="even">
<td>Sep 13</td>
<td>Problem Set 2 due</td>
</tr>
<tr class="odd">
<td>Sep 19</td>
<td>Problem Set 3 due</td>
</tr>
<tr class="even">
<td>Sep 26</td>
<td>Problem Set 4 due</td>
</tr>
<tr class="odd">
<td>Sep 26</td>
<td>Problem Set 5 due</td>
</tr>
<tr class="even">
<td>Oct 11</td>
<td>Problem Set 6 due</td>
</tr>
<tr class="odd">
<td>Oct 14</td>
<td>No class: Indigenous Peoples Day</td>
</tr>
Expand All @@ -323,41 +302,21 @@ <h2 class="anchored" data-anchor-id="key-dates---subject-to-change-after-first-w
</tr>
<tr class="odd">
<td>Oct 23</td>
<td>Start final project, obtain approval for personal project.</td>
<td>Start final project. Obtain approval if you want to do personal project instead.</td>
</tr>
<tr class="even">
<td>Nov 01</td>
<td>Problem Set 7 due</td>
</tr>
<tr class="odd">
<td>Nov 11</td>
<td>No class: Veterans’ Day</td>
</tr>
<tr class="even">
<td>Nov 01</td>
<td>Problem Set 7 due</td>
</tr>
<tr class="odd">
<td>Nov 22</td>
<td>Problem Set 8 due</td>
</tr>
<tr class="even">
<td>Nov 25</td>
<td>Midterm 2</td>
</tr>
<tr class="odd">
<tr class="even">
<td>Nov 27</td>
<td>No class: Thanksgiving Recess Begins</td>
</tr>
<tr class="even">
<td>Dec 06</td>
<td>Problem Set 9 due</td>
</tr>
<tr class="odd">
<td>Dec 13</td>
<td>Problem Set 10 due</td>
</tr>
<tr class="even">
<td>Dec 20</td>
<td>Final Project due</td>
</tr>
Expand All @@ -378,7 +337,7 @@ <h2 class="anchored" data-anchor-id="preliminary-schedule">Preliminary Schedule<
<tr class="odd">
<td style="text-align: left;">Sep 04</td>
<td style="text-align: left;">Productivity Tools</td>
<td style="text-align: left;"><a href="http://rafalab.dfci.harvard.edu/dsbook-part-1/R/getting-started.html">Getting Started</a>, <a href="http://rafalab.dfci.harvard.edu/dsbook-part-1/R/installing-r-and-rstudio.html">Installing R and RStudio</a>, <a href="http://rafalab.dfci.harvard.edu/dsbook-part-1/productivity/unix.html">Unix</a></td>
<td style="text-align: left;">Installing R and RStudio on <a href="https://teacherscollege.screenstepslive.com/a/1108074-install-r-and-rstudio-for-windows">Windows</a> or <a href="https://teacherscollege.screenstepslive.com/a/1135059-install-r-and-rstudio-for-mac">Mac</a>, <a href="http://rafalab.dfci.harvard.edu/dsbook-part-1/R/getting-started.html">Getting Started</a> <a href="http://rafalab.dfci.harvard.edu/dsbook-part-1/productivity/unix.html">Unix</a></td>
</tr>
<tr class="even">
<td style="text-align: left;">Sep 09</td>
Expand Down Expand Up @@ -426,7 +385,7 @@ <h2 class="anchored" data-anchor-id="preliminary-schedule">Preliminary Schedule<
<td style="text-align: left;"><a href="http://rafalab.dfci.harvard.edu/dsbook-part-1/dataviz/distributions.html">Distributions</a>, <a href="http://rafalab.dfci.harvard.edu/dsbook-part-1/dataviz/dataviz-principles.html">Dataviz Principles</a></td>
</tr>
<tr class="odd">
<td style="text-align: left;">Sep 26</td>
<td style="text-align: left;">Oct 04</td>
<td style="text-align: left;"><strong>Problem Set 5 due</strong></td>
<td style="text-align: left;">Difficulty: medium</td>
</tr>
Expand All @@ -448,7 +407,7 @@ <h2 class="anchored" data-anchor-id="preliminary-schedule">Preliminary Schedule<
<tr class="odd">
<td style="text-align: left;">Oct 16</td>
<td style="text-align: left;"><strong>Midterm 1</strong></td>
<td style="text-align: left;">Cover material from Sep 04-Oct 09</td>
<td style="text-align: left;">Covers material from Sep 04-Oct 11</td>
</tr>
<tr class="even">
<td style="text-align: left;">Oct 21, Oct 23</td>
Expand Down Expand Up @@ -481,8 +440,8 @@ <h2 class="anchored" data-anchor-id="preliminary-schedule">Preliminary Schedule<
<td style="text-align: left;"><a href="http://rafalab.dfci.harvard.edu/dsbook-part-2/linear-models/measurement-error-models.html">Measurement Error Models</a>, <a href="http://rafalab.dfci.harvard.edu/dsbook-part-2/linear-models/treatment-effect-models.html">Treatment Effect Models</a>, <a href="http://rafalab.dfci.harvard.edu/dsbook-part-2/linear-models/association-tests.html">Association Tests</a>, <a href="http://rafalab.dfci.harvard.edu/dsbook-part-2/linear-models/association-not-causation.html">Association Not Causation</a></td>
</tr>
<tr class="even">
<td style="text-align: left;">Nov 01</td>
<td style="text-align: left;"><strong>Problem Set 7 due</strong></td>
<td style="text-align: left;">Nov 15</td>
<td style="text-align: left;"><strong>Problem Set 8 due</strong></td>
<td style="text-align: left;">Difficulty: hard</td>
</tr>
<tr class="odd">
Expand All @@ -492,13 +451,13 @@ <h2 class="anchored" data-anchor-id="preliminary-schedule">Preliminary Schedule<
</tr>
<tr class="even">
<td style="text-align: left;">Nov 22</td>
<td style="text-align: left;"><strong>Problem Set 8 due</strong></td>
<td style="text-align: left;"><strong>Problem Set 9 due</strong></td>
<td style="text-align: left;">Difficulty: easy</td>
</tr>
<tr class="odd">
<td style="text-align: left;">Nov 25</td>
<td style="text-align: left;"><strong>Midterm 2</strong></td>
<td style="text-align: left;">Cover material from Sep 04-Nov 20</td>
<td style="text-align: left;">Covers material from Sep 04-Nov 22</td>
</tr>
<tr class="even">
<td style="text-align: left;">Nov 27</td>
Expand All @@ -508,29 +467,24 @@ <h2 class="anchored" data-anchor-id="preliminary-schedule">Preliminary Schedule<
<tr class="odd">
<td style="text-align: left;">Dec 02, Dec 04</td>
<td style="text-align: left;">Machine Learning</td>
<td style="text-align: left;"><a href="https://rafalab.dfci.harvard.edu/dsbook-part-2/ml/notation-and-terminology.html">Notation and terminology</a>, [Evaluation Metrics</td>
<td style="text-align: left;"><a href="https://rafalab.dfci.harvard.edu/dsbook-part-2/ml/notation-and-terminology.html">Notation and terminology</a>, <a href="https://rafalab.dfci.harvard.edu/dsbook-part-2/ml/evaluation-metrics.html">Evaluation Metrics</a>, <a href="https://rafalab.dfci.harvard.edu/dsbook-part-2/ml/conditionals.html">conditional probabilities</a>, <a href="https://rafalab.dfci.harvard.edu/dsbook-part-2/ml/smoothing.html">smoothing</a></td>
</tr>
<tr class="even">
<td style="text-align: left;">Dec 06</td>
<td style="text-align: left;"><strong>Problem Set 9 due</strong></td>
<td style="text-align: left;">Difficulty: easy</td>
</tr>
<tr class="odd">
<td style="text-align: left;">Dec 09, Dec 11</td>
<td style="text-align: left;">Machine Learning</td>
<td style="text-align: left;"><a href="https://rafalab.dfci.harvard.edu/dsbook-part-2/ml/resampling-methods.html">Resampling methods</a>, <a href="https://rafalab.dfci.harvard.edu/dsbook-part-2/ml/algorithms.html">ML algorithms</a>, <a href="https://rafalab.dfci.harvard.edu/dsbook-part-2/ml/ml-in-practice.html">ML in practice</a></td>
</tr>
<tr class="even">
<tr class="odd">
<td style="text-align: left;">Dec 13</td>
<td style="text-align: left;"><strong>Problem Set 10 due</strong></td>
<td style="text-align: left;">Difficulty: hard</td>
</tr>
<tr class="odd">
<tr class="even">
<td style="text-align: left;">Dec 16, Dec 18</td>
<td style="text-align: left;">Other topics</td>
<td style="text-align: left;"></td>
</tr>
<tr class="even">
<tr class="odd">
<td style="text-align: left;">Dec 20</td>
<td style="text-align: left;"><strong>Final Project due</strong></td>
<td style="text-align: left;"></td>
Expand Down
2 changes: 0 additions & 2 deletions index.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -20,5 +20,3 @@



#

Loading

0 comments on commit 5b21916

Please sign in to comment.