Coding Standard

Scope: All Python code under forgelm/ and tests/. Enforced by: ruff check + ruff format --check in CI (.github/workflows/ci.yml).

Tooling

Configured in pyproject.toml:

[tool.ruff]
target-version = "py310"
line-length = 120

[tool.ruff.lint]
select = ["F", "E9", "W", "B", "I"]
ignore = ["B008", "B905"]

Python 3.10+ only. Support matrix: 3.10, 3.11, 3.12, 3.13 (tested in CI).
Line length: 120. Not 80, not 88.
Lint rules: pyflakes (F), syntax errors (E9), warnings (W), bugbear (B), import order (I).
Format: ruff format (Black-compatible). Run before every commit.

Install dev tools: pip install -e '.[dev]' pulls in ruff>=0.4.0 and pytest>=8.0.0.

Naming

Follow PEP 8 with one clarification:

Kind	Convention	Example
Module	`snake_case.py`	`forgelm/model_card.py`
Class	`PascalCase`	`ForgeConfig`, `AuditLogger`, `WebhookNotifier`
Function / method	`snake_case`	`train_with_auto_revert`, `generate_model_card`
Constant	`UPPER_SNAKE_CASE`	`EXIT_CONFIG_ERROR`, `DEFAULT_TIMEOUT`
Private	`_leading_underscore`	`_setup_logging`, `_send`
Pydantic config class	`XxxConfig`	`ModelConfig`, `LoraConfigModel`, `TrainingConfig`

One-letter variables: allowed only for conventional math/loop locals (i, j, x, y). In domain code prefer descriptive names.

Type hints

Required on public API. Use them on every def that's imported elsewhere or called from CLI.

Use Optional[X], not X | None. Codebase is uniformly Optional (e.g., forgelm/config.py, forgelm/trainer.py). Stay consistent even though both are valid on 3.10+.
Use List[X], Dict[K, V], Tuple[...] from typing — matches existing imports.
Use Literal["a", "b"] for enum-like string fields (seen throughout forgelm/config.py).
Any is acceptable where structure is external (YAML dicts, HF return values) but prefer concrete types where possible.

from typing import Any, Dict, List, Literal, Optional

def generate_data_governance_report(
    config: Any,
    dataset: Dict[str, Any],
) -> Dict[str, Any]:
    ...

Docstrings

Google style. One-line summary, blank line, optional sections (Args:, Returns:, Raises:).

Module docstrings are required and should state purpose; for compliance-touching modules, cite the EU AI Act article:

"""EU AI Act compliance, training data provenance, and audit trail generation.

Covers: Article 9 (Risk Management), Article 10 (Data Governance),
Article 11 + Annex IV (Technical Documentation), Article 12 (Record-Keeping),
Article 13 (Transparency), Article 14 (Human Oversight), Article 15 (Accuracy),
and Article 17 (Quality Management System).
"""

Function docstrings are required when:

The function is public (no leading underscore) and non-trivial, or
The behaviour is not obvious from the name and signature.

Keep them terse — if the docstring would just restate the name, skip it.

Imports

Managed by ruff (rule I). Order:

Standard library
Third-party
Local (forgelm.*)

Within each group, alphabetical. ruff format handles this automatically.

Do not use wildcard imports (from x import *). Do not re-export via __init__.py unless there's a deliberate public API reason.

Pydantic models

Every YAML-backed config section is a BaseModel subclass in forgelm/config.py. Follow:

Field order: required first, then optional with defaults.
Defaults must be safe for "omit from YAML" case.
Use field_validator for cross-field checks; prefer model_validator(mode='after') for whole-model invariants.
Do not silently coerce invalid values. Raise ValueError with actionable message (see error-handling.md).

class TrainingConfig(BaseModel):
    trainer_type: Literal["sft", "dpo", "simpo", "kto", "orpo", "grpo"] = "sft"
    num_train_epochs: float = 1.0
    per_device_train_batch_size: int = 1
    ...

Comments

Default to no comments. Only add them when why is non-obvious:

A non-trivial workaround → name the upstream bug or issue.
A deliberately surprising default → say why.
A TODO with an owner and condition.

Never write comments that restate the code. Never write comments that reference a specific PR, issue, or past fix — that belongs in git history or the changelog.

NOSONAR discipline

Suppressing a SonarCloud rule with # NOSONAR is allowed only when the rule's premise genuinely does not bite, and the suppression must be auditable. Every new # NOSONAR site must satisfy:

The Sonar rule code on the same line. Write # NOSONAR python:S5332 for a plain-HTTP rejection-message string, # NOSONAR python:S5852 for a regex over controlled (non-user-facing) input, and so on. Bare # NOSONAR without the rule code is rejected — Sonar treats unscoped suppressions as low-confidence and may continue flagging.
A one-line rationale. Either trailing on the same line (terse) or in a prose block immediately above the offending line. The rationale must explain why the rule's premise does not bite — e.g. "rejection-path message, not an outbound HTTP call" or "controlled static-HTML input, not user-facing".
Combined Sonar + bandit suppressions on a single line. Prefer # NOSONAR python:S5332 # nosec B603 — rationale over two stacked comments on adjacent lines.
BLE001 + NOSONAR pairing. except Exception sites that already carry # noqa: BLE001 — <rationale> per error-handling.md may trail the line with # NOSONAR to silence Sonar's broad-catch warning; the BLE001 rationale doubles as the NOSONAR rationale, but the explicit Sonar rule code (python:S5754) should still be present.

Examples grounded in shipped code:

# Good — outbound-HTTP rule muted because the string is a rejection message:
raise ValueError(
    f"http:// blocked (use https://); url={_mask_netloc(url)}"  # NOSONAR python:S5332
)

# Good — broad-catch + Sonar pairing on a BLE001 site:
except Exception as e:  # noqa: BLE001 — best-effort: lm-eval HFLM wrapper crosses a wide error tail (HF introspection AttributeError, tokenizer ValueError, CUDA RuntimeError); BenchmarkResult(passed=False) is the documented gate surface.  # NOSONAR python:S5754

# NOSONAR is never a substitute for fixing the underlying issue. If a Sonar rule fires on input you control (e.g. a regex consuming user-supplied text), prefer rewriting to satisfy the rule (state machine, anchored regex, narrower exception class) over suppressing it.

Anti-patterns (rejected at review)

Concrete sins ForgeLM avoids — distilled from prior PR-cycle audits and external-repo comparisons:

Anti-pattern	Why rejected	Correct form
Silent import fallback: `try: import X; except: X = None`	Breaks type hints, hides missing deps	Optional extras with explicit `ImportError` + install hint
`		true` in CI
`torch.CUDA` (actual typo seen in another repo)	Runtime error disguised as config	Use `torch.cuda` always
Zero-byte files	Git noise	Delete or fill with minimal valid content
Hypothetical file paths in docs	Drift between docs and code	Every cited path must exist in a CI check
Placeholder stubs marked "v2.0 Ready"	False advertising	`NotImplementedError("Planned for Phase N")` with issue link
`[A-Za-z0-9_]` in regex	Verbose; SonarCloud `python:S6353`	`\w`
`[ ]{0,3}` (single-char class)	Noisy; SonarCloud `python:S6328`	`{0,3}`
Two competing greedy/lazy quantifiers over the same char class (`[ \t]+(.+?)[ \t]*$`)	O(n²) ReDoS — confirmed at `n=2000` in `_MARKDOWN_HEADING_PATTERN`; review round 2.5	Anchor on `\S` at body boundaries: `[ \t]+(\S(?:[^\n]\S)?)[ \t]$`
`.*?` + back-reference + `re.DOTALL`	SonarCloud `python:S5852`; replace with state machine	Per-line walker (see `_strip_code_fences`)

For deeper regex rules (8 hard rules + ReDoS exposure budget + test fixture hygiene), see regex.md.

When you break these rules

You don't. Ruff will catch the style ones. Reviewers will catch the rest. If a rule actively blocks a legitimate change, open a PR here first to update the standard — with reasoning.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Coding Standard

Tooling

Naming

Type hints

Docstrings

Imports

Pydantic models

Comments

NOSONAR discipline

Anti-patterns (rejected at review)

When you break these rules

FilesExpand file tree

coding.md

Latest commit

History

coding.md

File metadata and controls

Coding Standard

Tooling

Naming

Type hints

Docstrings

Imports

Pydantic models

Comments

NOSONAR discipline

Anti-patterns (rejected at review)

When you break these rules