Skip to content

Implement text extraction from raw HTML + client-side rendering (CSR) detection in QA#992

Draft
emma-sg wants to merge 10 commits intomainfrom
qa-csr-clues-and-text-compare
Draft

Implement text extraction from raw HTML + client-side rendering (CSR) detection in QA#992
emma-sg wants to merge 10 commits intomainfrom
qa-csr-clues-and-text-compare

Conversation

@emma-sg
Copy link
Member

@emma-sg emma-sg commented Mar 11, 2026

Adds option (--qaDetectClientSideRendering) to analyze replayed html content to detect CSR clues such as framework indicators, hydration data, web components, and DOM manipulation patterns to identify client-side rendered pages. Output is written to the info warcs along with the rest of the page info (pages, comparison info, etc).

Will follow up with extracted text analysis using & not using additional page resources.

@emma-sg emma-sg force-pushed the qa-csr-clues-and-text-compare branch from ba2edf1 to e40d96f Compare March 11, 2026 20:41
@socket-security
Copy link

socket-security bot commented Mar 18, 2026

Review the following changes in direct dependencies. Learn more about Socket for GitHub.

Diff Package Supply Chain
Security
Vulnerability Quality Maintenance License
Added@​types/​jsdom@​28.0.01001007486100
Addedjsdom@​29.0.07510010096100

View full report

@emma-sg emma-sg changed the title Implement client-side rendering (CSR) detection in QA Implement text extraction from raw HTML + client-side rendering (CSR) detection in QA Mar 23, 2026
emma-sg added 7 commits March 23, 2026 14:13
adds option to analyze replayed html content to detect csr clues
such as framework indicators, hydration data, web components, and
dom manipulation patterns to identify client-side rendered pages

will follow with text analysis using & not using additional page
resources
@emma-sg emma-sg force-pushed the qa-csr-clues-and-text-compare branch from 4d5e40d to dd299af Compare March 23, 2026 18:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant