Skip to content

Commit

Permalink
remove www
Browse files Browse the repository at this point in the history
  • Loading branch information
natolambert committed Aug 12, 2024
1 parent 3a77bbf commit 16181fb
Show file tree
Hide file tree
Showing 2 changed files with 34 additions and 34 deletions.
34 changes: 17 additions & 17 deletions templates/chapter.html
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@ <h1 class="title">$title$</h1>
<nav class="chapter-nav">
<div class="section">
<ul>
<li><a href="https://www.rlhfbook.com">Home</a></li>
<li><a href="https://rlhfbook.com">Home</a></li>
<li><a href="https://github.com/natolambert/rlhf-book">GitHub Repository</a></li>
<li>PDF (Soon)</li>
<li>Order a copy (Soon)</li>
Expand All @@ -56,46 +56,46 @@ <h1 class="title">$title$</h1>
<div class="section">
<p><strong>Introductions</strong></p>
<ol>
<li><a href="https://www.rlhfbook.com/c/01-1-introduction.html">Introduction</a></li>
<li><a href="https://www.rlhfbook.com/c/01-2-preferences.html">What are preferences?</a></li>
<li><a href="https://www.rlhfbook.com/c/01-3-optimization.html">Optimization and RL</a></li>
<li><a href="https://www.rlhfbook.com/c/01-4-related-works.html">Seminal (Recent) Works</a></li>
<li><a href="https://rlhfbook.com/c/01-1-introduction.html">Introduction</a></li>
<li><a href="https://rlhfbook.com/c/01-2-preferences.html">What are preferences?</a></li>
<li><a href="https://rlhfbook.com/c/01-3-optimization.html">Optimization and RL</a></li>
<li><a href="https://rlhfbook.com/c/01-4-related-works.html">Seminal (Recent) Works</a></li>
</ol>
</div>

<div class="section">
<p><strong>Problem Setup</strong></p>
<ol>
<li><a href="https://www.rlhfbook.com/c/02-1-setup.html">Definitions</a></li>
<li><a href="https://www.rlhfbook.com/c/02-2-preference-data.html">Preference Data</a></li>
<li><a href="https://www.rlhfbook.com/c/02-3-reward-models.html">Reward Modeling</a></li>
<li><a href="https://www.rlhfbook.com/c/02-4-regularization.html">Regularization</a></li>
<li><a href="https://rlhfbook.com/c/02-1-setup.html">Definitions</a></li>
<li><a href="https://rlhfbook.com/c/02-2-preference-data.html">Preference Data</a></li>
<li><a href="https://rlhfbook.com/c/02-3-reward-models.html">Reward Modeling</a></li>
<li><a href="https://rlhfbook.com/c/02-4-regularization.html">Regularization</a></li>
</ol>
</div>

<div class="section">
<p><strong>Optimization</strong></p>
<ol>
<li><a href="https://www.rlhfbook.com/c/03-1-instructions.html">Instruction Tuning</a></li>
<li><a href="https://www.rlhfbook.com/c/03-2-rejection-sampling.html">Rejection Sampling</a></li>
<li><a href="https://www.rlhfbook.com/c/03-3-policy-gradients.html">Policy Gradients</a></li>
<li><a href="https://www.rlhfbook.com/c/03-4-direct-alignment.html">Direct Alignment Algorithms</a></li>
<li><a href="https://rlhfbook.com/c/03-1-instructions.html">Instruction Tuning</a></li>
<li><a href="https://rlhfbook.com/c/03-2-rejection-sampling.html">Rejection Sampling</a></li>
<li><a href="https://rlhfbook.com/c/03-3-policy-gradients.html">Policy Gradients</a></li>
<li><a href="https://rlhfbook.com/c/03-4-direct-alignment.html">Direct Alignment Algorithms</a></li>
</ol>
</div>

<div class="section">
<p><strong>Advanced (TBD)</strong></p>
<ol>
<li><a href="https://www.rlhfbook.com/c/04-1-cai.html">Constitutional AI</a></li>
<li><a href="https://www.rlhfbook.com/c/04-2-synthetic.html">Synthetic Data</a></li>
<li><a href="https://www.rlhfbook.com/c/04-3-evaluation.html">Evaluation</a></li>
<li><a href="https://rlhfbook.com/c/04-1-cai.html">Constitutional AI</a></li>
<li><a href="https://rlhfbook.com/c/04-2-synthetic.html">Synthetic Data</a></li>
<li><a href="https://rlhfbook.com/c/04-3-evaluation.html">Evaluation</a></li>
</ol>
</div>

<div class="section">
<p><strong>Open Questions (TBD)</strong></p>
<ol>
<li><a href="https://www.rlhfbook.com/c/05-1-over-optimization.html">Over-optimization</a></li>
<li><a href="https://rlhfbook.com/c/05-1-over-optimization.html">Over-optimization</a></li>
<li>Style</li>
</ol>
</div>
Expand Down
34 changes: 17 additions & 17 deletions templates/html.html
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@ <h1 class="title">$title$</h1>
<nav class="chapter-nav">
<div class="section">
<ul>
<li><a href="https://www.rlhfbook.com">Home</a></li>
<li><a href="https://rlhfbook.com">Home</a></li>
<li><a href="https://github.com/natolambert/rlhf-book">GitHub Repository</a></li>
<li>PDF (Soon)</li>
<li>Order a copy (Soon)</li>
Expand All @@ -56,46 +56,46 @@ <h1 class="title">$title$</h1>
<div class="section">
<p><strong>Introductions</strong></p>
<ol>
<li><a href="https://www.rlhfbook.com/c/01-1-introduction.html">Introduction</a></li>
<li><a href="https://www.rlhfbook.com/c/01-2-preferences.html">What are preferences?</a></li>
<li><a href="https://www.rlhfbook.com/c/01-3-optimization.html">Optimization and RL</a></li>
<li><a href="https://www.rlhfbook.com/c/01-4-related-works.html">Seminal (Recent) Works</a></li>
<li><a href="https://rlhfbook.com/c/01-1-introduction.html">Introduction</a></li>
<li><a href="https://rlhfbook.com/c/01-2-preferences.html">What are preferences?</a></li>
<li><a href="https://rlhfbook.com/c/01-3-optimization.html">Optimization and RL</a></li>
<li><a href="https://rlhfbook.com/c/01-4-related-works.html">Seminal (Recent) Works</a></li>
</ol>
</div>

<div class="section">
<p><strong>Problem Setup</strong></p>
<ol>
<li><a href="https://www.rlhfbook.com/c/02-1-setup.html">Definitions</a></li>
<li><a href="https://www.rlhfbook.com/c/02-2-preference-data.html">Preference Data</a></li>
<li><a href="https://www.rlhfbook.com/c/02-3-reward-models.html">Reward Modeling</a></li>
<li><a href="https://www.rlhfbook.com/c/02-4-regularization.html">Regularization</a></li>
<li><a href="https://rlhfbook.com/c/02-1-setup.html">Definitions</a></li>
<li><a href="https://rlhfbook.com/c/02-2-preference-data.html">Preference Data</a></li>
<li><a href="https://rlhfbook.com/c/02-3-reward-models.html">Reward Modeling</a></li>
<li><a href="https://rlhfbook.com/c/02-4-regularization.html">Regularization</a></li>
</ol>
</div>

<div class="section">
<p><strong>Optimization</strong></p>
<ol>
<li><a href="https://www.rlhfbook.com/c/03-1-instructions.html">Instruction Tuning</a></li>
<li><a href="https://www.rlhfbook.com/c/03-2-rejection-sampling.html">Rejection Sampling</a></li>
<li><a href="https://www.rlhfbook.com/c/03-3-policy-gradients.html">Policy Gradients</a></li>
<li><a href="https://www.rlhfbook.com/c/03-4-direct-alignment.html">Direct Alignment Algorithms</a></li>
<li><a href="https://rlhfbook.com/c/03-1-instructions.html">Instruction Tuning</a></li>
<li><a href="https://rlhfbook.com/c/03-2-rejection-sampling.html">Rejection Sampling</a></li>
<li><a href="https://rlhfbook.com/c/03-3-policy-gradients.html">Policy Gradients</a></li>
<li><a href="https://rlhfbook.com/c/03-4-direct-alignment.html">Direct Alignment Algorithms</a></li>
</ol>
</div>

<div class="section">
<p><strong>Advanced (TBD)</strong></p>
<ol>
<li><a href="https://www.rlhfbook.com/c/04-1-cai.html">Constitutional AI</a></li>
<li><a href="https://www.rlhfbook.com/c/04-2-synthetic.html">Synthetic Data</a></li>
<li><a href="https://www.rlhfbook.com/c/04-3-evaluation.html">Evaluation</a></li>
<li><a href="https://rlhfbook.com/c/04-1-cai.html">Constitutional AI</a></li>
<li><a href="https://rlhfbook.com/c/04-2-synthetic.html">Synthetic Data</a></li>
<li><a href="https://rlhfbook.com/c/04-3-evaluation.html">Evaluation</a></li>
</ol>
</div>

<div class="section">
<p><strong>Open Questions (TBD)</strong></p>
<ol>
<li><a href="https://www.rlhfbook.com/c/05-1-over-optimization.html">Over-optimization</a></li>
<li><a href="https://rlhfbook.com/c/05-1-over-optimization.html">Over-optimization</a></li>
<li>Style</li>
</ol>
</div>
Expand Down

0 comments on commit 16181fb

Please sign in to comment.