Methodology by yeaight7 · Pull Request #27 · yeaight7/TFG_CYBER_AI

yeaight7 · 2026-05-17T22:59:02Z

This pull request significantly expands and clarifies the methodology chapter, adding detailed explanations of the experimental design, feature schema, reward structure, algorithmic choices, and limitations. The changes improve transparency, reproducibility, and interpretability of the RL-based network intrusion detection pipeline. The most important changes are grouped and summarized below:

Feature Schema and Data Handling

Added Table~\ref{tab:canonical-groups} and detailed description of the canonical flow feature groups, clarifying the structure and meaning of the 76-feature schema used throughout the pipeline.
Expanded diagnostics for laboratory flow inference, including new outputs for missingness, feature scaling statistics, and out-of-distribution detection.

Reward Function and Environment Protocol

Introduced a reward decision matrix (Table~\ref{tab:reward-matrix}) and expanded discussion of the reward structure, including its effect on agent behavior and the rationale for zero true-negative reward.
Added a detailed protocol for environment state transitions, clarifying how training and evaluation modes are separated and how reproducibility is ensured.

Algorithmic Details and Baseline Comparison

Added sections on QRDQN’s quantile regression loss and exploration strategy, explaining the distributional RL approach and its implications for cost-sensitive learning.
Documented class imbalance handling and cost-sensitive weighting for the Random Forest baseline, ensuring fair comparison with the RL agent under asymmetric costs.
Introduced a training-scale and data-efficiency protocol, describing how learning curves are generated and compared across models and budgets.

Validation and Reproducibility

Expanded the validation ladder with explanations for each rung, highlighting their roles in detecting leakage, measuring generalization, and testing domain shift.
Added a new section on implementation framework, including details on core libraries, artifact management, and reproducibility controls for experimental runs.

Methodological Limitations

Reorganized and elaborated the limitations section into four explicit categories: static environment, binary label mapping, benchmark fragility, and scope of reproducibility. This clarifies the boundaries of the thesis claims and future work required for stronger evidence.

These changes collectively make the methodology more rigorous, auditable, and easier to reproduce or extend.

…ema and reward matrix, improving clarity on flow features and agent decision-making process.

yeaight7 added 2 commits May 18, 2026 00:58

Enhance methodology chapter by adding detailed tables for feature sch…

5816790

…ema and reward matrix, improving clarity on flow features and agent decision-making process.

Implement feature X to enhance user experience and fix bug Y in module Z

b963669

yeaight7 self-assigned this May 17, 2026

yeaight7 requested review from Copilot and removed request for Copilot May 17, 2026 22:59

Merge branch 'main' into methodology

30a6aaf

Copilot started reviewing on behalf of yeaight7 May 17, 2026 22:59 View session

yeaight7 merged commit f8c488e into main May 17, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Methodology#27

Methodology#27
yeaight7 merged 3 commits into
mainfrom
methodology

yeaight7 commented May 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yeaight7 commented May 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant