Name	Name	Last commit message	Last commit date
parent directory ..
django_14170	django_14170
pydata_xarray_6938	pydata_xarray_6938
README.md	README.md

Name

Last commit message

Last commit date

Case Study

This directory includes some case analysis. We compare the both method(grep + Claude Context semantic search) and the traditional grep only method.

These cases are selected from the Princeton NLP's SWE-bench_Verified dataset. The results and the logs are generated by the run_evaluation.py script. For more details, please refer to the evaluation README.md file.

📁 django_14170: Query optimization in YearLookup breaks filtering by "__iso_year"
📁 pydata_xarray_6938: .swap_dims() can modify original object

Each case study includes:

Original Issue: The GitHub issue description and requirements
Problem Analysis: Technical breakdown of the bug and expected solution
Method Comparison: Detailed comparison of both approaches
Conversation Logs: The interaction records showing how the LLM agent call the ols and generate the final answer.
Results: Performance metrics and outcome analysis

Key Results

Compared with traditional grep only, the both method(grep + Claude Context semantic search) is more efficient and accurate.