Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
31 commits
Select commit Hold shift + click to select a range
728ecaa
setup and test API
Jun 21, 2025
0c2b37d
feat: implement Claude API integration with oversight curriculum fram…
Jun 21, 2025
8a9ba4f
feat: add Claude API integration and testing framework
Jun 21, 2025
702209c
add minimal deduction loop
Jun 21, 2025
7483cb2
implement HHH filter and best of n sampling
Jun 21, 2025
0886cd4
integrated pipelines
Jun 21, 2025
4b1536f
add safety measurement demo
Jun 21, 2025
1d137cb
generating best of n results
Jun 21, 2025
5f216ae
add robust results generations and unit test
Jun 21, 2025
47633af
fix minor bugs
Jun 21, 2025
0c27af5
enhanced azr with humaneval
Jun 21, 2025
61c9c11
added enhanced execution plan
Jun 22, 2025
ed9c40a
fixed minor bug on final execution plan
Jun 22, 2025
8e0bed6
quick fixes for demo
Jun 22, 2025
051a589
ran enhanced azr
Jun 22, 2025
65d3afb
added standardized validation for every run files
Jun 22, 2025
bfe387e
updated readme and make sure run_demo and run_full is validated like …
Jun 22, 2025
b5c16a7
added structure progress monitor
Jun 22, 2025
2fb6bf0
output structure status report
Jun 22, 2025
fdd4cf6
added QA report
Jun 22, 2025
66e0871
added QA checklist
Jun 22, 2025
f4799ad
added artifact reports generations
Jun 22, 2025
dad4e4e
fixed for good SE hygiene
Jun 22, 2025
dd5c096
updated readme
Jun 22, 2025
9f7eb6f
added oversight as package
Jun 22, 2025
34382da
made the package production ready
Jun 22, 2025
2298508
feat(pkg): promote code to oversight/ package & fix build
Jun 22, 2025
92bec26
feat(pkg): add core modules to oversight package after src���oversigh…
Jun 22, 2025
68c8230
fix: restore config.py content that was lost
Jun 22, 2025
dcb67a6
amended author
sandguine Jun 22, 2025
2e9dccd
Merge origin/main into fix/await-run-cycle (resolve README.md conflict)
sandguine Jun 22, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
31 changes: 31 additions & 0 deletions .github/PULL_REQUEST_TEMPLATE.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
## 🚀 Pull Request Checklist

### CI & Quality
- [ ] CI ✔️ (no fails, ≤ 2 skips)
- [ ] All tests passing locally
- [ ] Code follows style guidelines
- [ ] No new warnings introduced

### Evidence & Validation
- [ ] Evidence artifacts attached if run type = validation
- [ ] Performance benchmarks updated if applicable
- [ ] Documentation updated if needed

### Review
- [ ] Self-review completed
- [ ] Code review requested from appropriate reviewers
- [ ] All feedback addressed

---

## 📋 Changes Summary

<!-- Describe the changes made in this PR -->

## 🧪 Testing

<!-- Describe how you tested these changes -->

## 📊 Evidence

<!-- If this PR includes validation runs, attach evidence artifacts here -->
147 changes: 147 additions & 0 deletions .github/workflows/ci.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,147 @@
name: CI

on:
push:
branches: [ main, develop ]
pull_request:
branches: [ main ]

jobs:
test:
runs-on: ubuntu-latest
strategy:
matrix:
run_external: [false] # fast lane only
fail-fast: false

steps:
- uses: actions/checkout@v4

- name: Set up Python
uses: actions/setup-python@v4
with:
python-version: '3.12'

- name: Install dependencies
run: |
python -m pip install --upgrade pip
pip install -e .
pip install -e ".[dev]"

- name: Export & mask dummy secret
run: |
echo "CLAUDE_API_KEY=dummy" >> $GITHUB_ENV
echo "::add-mask::dummy"

- name: Run tests
run: |
pytest -m "not external"

- name: Upload coverage
uses: actions/upload-artifact@v4
with:
name: coverage
path: coverage.xml
if-no-files-found: ignore

- name: Upload test summary
uses: actions/upload-artifact@v4
with:
name: test-summary
path: artifacts/test_summary.json
if-no-files-found: ignore

qa:
runs-on: ubuntu-latest
needs: test
steps:
- uses: actions/checkout@v3

- name: Set up Python
uses: actions/setup-python@v4
with:
python-version: '3.9'

- name: Install dependencies
run: |
python -m pip install --upgrade pip
pip install -r requirements.txt
pip install pytest pytest-cov

- name: Run QA Checklist
run: |
python scripts/qa_checklist.py --verbose
env:
CLAUDE_API_KEY: ${{ secrets.CLAUDE_API_KEY }}

- name: Upload QA artifacts
uses: actions/upload-artifact@v3
if: always()
with:
name: qa-artifacts-${{ github.run_number }}
path: |
artifacts/
coverage.xml
htmlcov/
logs/
retention-days: 7

safety:
runs-on: ubuntu-latest
needs: test
steps:
- uses: actions/checkout@v3

- name: Set up Python
uses: actions/setup-python@v4
with:
python-version: '3.9'

- name: Install dependencies
run: |
python -m pip install --upgrade pip
pip install -r requirements.txt

- name: Run safety tests
run: |
python scripts/safety_dashboard.py --test
env:
CLAUDE_API_KEY: ${{ secrets.CLAUDE_API_KEY }}

- name: Upload safety results
uses: actions/upload-artifact@v3
if: always()
with:
name: safety-results-${{ github.run_number }}
path: logs/safety_test_*.json
retention-days: 7

dashboard:
runs-on: ubuntu-latest
needs: [test, qa, safety]
steps:
- uses: actions/checkout@v3

- name: Set up Python
uses: actions/setup-python@v4
with:
python-version: '3.9'

- name: Install dependencies
run: |
python -m pip install --upgrade pip
pip install -r requirements.txt

- name: Generate dashboard
run: |
python scripts/progress_monitor.py --dashboard --html results/dashboard_${{ github.run_number }}.html
env:
CLAUDE_API_KEY: ${{ secrets.CLAUDE_API_KEY }}

- name: Upload dashboard
uses: actions/upload-artifact@v3
if: always()
with:
name: dashboard-${{ github.run_number }}
path: results/dashboard_${{ github.run_number }}.html
retention-days: 7
5 changes: 5 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
venv/
.cache/
__pycache__/
*.pyc
.env
1 change: 1 addition & 0 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
repos:
23 changes: 23 additions & 0 deletions CITATION.cff
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Oversight Curriculum"
given-names: "Team"
orcid: "https://orcid.org/0000-0000-0000-0000"
title: "Oversight Curriculum - AI Safety & Reasoning System"
version: 1.0.0
doi: 10.5281/zenodo.0000000
date-released: 2024-12-21
url: "https://github.com/oversight-curriculum/oversight-curriculum"
repository-code: "https://github.com/oversight-curriculum/oversight-curriculum"
license: MIT
keywords:
- "ai-safety"
- "reasoning"
- "oversight"
- "curriculum"
- "claude"
- "anthropic"
- "machine-learning"
- "python"
abstract: "Advanced AI safety and reasoning system that combines Absolute Zero Reasoner (AZR) self-play, best-of-n sampling, and HHH safety filtering to create a robust oversight curriculum for AI safety evaluation and improvement."
Loading
Loading