# Test Fixtures

This directory contains HTML fixtures for parser tests.

## Fixture Strategy

- **Plan 02 (this plan):** All tests use inline HTML strings synthesized to match gdqy.gov.cn
  structure (no network dependency). Fixtures are embedded directly in test files.

- **Plan 03 (next plan):** The smoke test (`test_smoke_gdqy.py`) will fetch
  `https://www.gdqy.gov.cn/gdqy/newxxgk/fgwj/szfwj/content/post_2136593.html`
  with patchright and save the result as `gdqy_post_2136593.html` for offline replay.

## Synthesized HTML Structure (gdqy.gov.cn)

The synthetic HTML used in Plan 02 tests matches the known gdqy.gov.cn DOM conventions:

- Article title: `h1.article-title`
- Publication time: `span.time`
- Source: `span.source`
- Content: `div.article-content`
- Attachments: `a[href$='.pdf']` etc. inside content
- List rows: `ul.list_news li` with `a` and `span.date`
