Files
ccdi/docs/plans/backend/2026-03-18-lsfx-mock-large-transaction-backend-implementation.md

422 lines
13 KiB
Markdown
Raw Normal View History

# LSFX Mock Large Transaction Backend Implementation Plan
> **For agentic workers:** REQUIRED: Use superpowers:subagent-driven-development (if subagents available) or superpowers:executing-plans to implement this plan. Steps use checkbox (`- [ ]`) syntax for tracking.
**Goal:** 调整 `lsfx-mock-server` 的银行流水生成逻辑,使同一 `logId` 的分页流水稳定包含可命中后端大额交易 8 条规则的样本,同时保留随机噪声流水。
**Architecture:** 在 `lsfx-mock-server` 内新增独立的大额交易样本生成模块,维护默认身份池、阈值常量与 5 组规则样本簇;`statement_service.py` 负责调用样本生成器、补足噪声流水、统一分配流水 ID 并缓存分页结果;测试分成样本生成测试和 API 返回测试两层,最后补一份实施记录文档。
**Tech Stack:** Python 3, FastAPI, pytest, httpx TestClient
---
### Task 1: 固化样本生成边界与文件结构
**Files:**
- Create: `lsfx-mock-server/services/statement_rule_samples.py`
- Modify: `lsfx-mock-server/services/statement_service.py`
- Test: `lsfx-mock-server/tests/test_statement_service.py`
- [ ] **Step 1: Write the failing test**
先新增一个最小单测,约束后续服务层会调用“命中样本 + 噪声流水”两段式生成,而不是继续只走单条完全随机逻辑:
```python
from services.statement_service import StatementService
def test_generate_statements_should_include_seeded_samples_before_noise():
service = StatementService()
statements = service._generate_statements(group_id=1000, log_id=20001, count=30)
assert len(statements) >= 30
assert any(item["userMemo"] == "购买房产首付款" for item in statements)
```
- [ ] **Step 2: Run test to verify it fails**
Run:
```bash
cd lsfx-mock-server
pytest tests/test_statement_service.py -k seeded_samples -v
```
Expected:
- `FAIL`
- 原因是 `tests/test_statement_service.py` 尚不存在,且 `StatementService` 还没有命中样本生成能力
- [ ] **Step 3: Write minimal implementation**
先搭文件骨架和最小接口,不一次性塞完整逻辑,但要保证首个样本断言可以通过:
```python
# lsfx-mock-server/services/statement_rule_samples.py
DEFAULT_LARGE_TRANSACTION_THRESHOLDS = {
"SINGLE_TRANSACTION_AMOUNT": 1111,
"CUMULATIVE_TRANSACTION_AMOUNT": 50000001,
"ANNUAL_TURNOVER": 50000001,
"LARGE_CASH_DEPOSIT": 2000001,
"FREQUENT_CASH_DEPOSIT": 5,
"FREQUENT_TRANSFER": 100001,
}
def build_large_transaction_seed_statements(group_id: int, log_id: int) -> list[dict]:
return [
{
"groupId": group_id,
"batchId": log_id,
"userMemo": "购买房产首付款",
"customerName": "杭州贝壳房地产经纪有限公司",
"drAmount": 680000.0,
"crAmount": 0.0,
"cashType": "对公转账",
"cretNo": "330101198801010011",
}
]
```
```python
# lsfx-mock-server/services/statement_service.py
from services.statement_rule_samples import build_large_transaction_seed_statements
def _generate_statements(self, group_id: int, log_id: int, count: int) -> List[Dict]:
statements = build_large_transaction_seed_statements(group_id, log_id)
while len(statements) < count:
statements.append(self._generate_random_statement(len(statements), group_id, log_id))
return statements
```
- [ ] **Step 4: Run test to verify it passes**
Run:
```bash
cd lsfx-mock-server
pytest tests/test_statement_service.py -k seeded_samples -v
```
Expected:
- `PASS`
- 说明样本生成器与服务层的最小整合入口已经建立
- [ ] **Step 5: Commit**
```bash
git add lsfx-mock-server/services/statement_rule_samples.py lsfx-mock-server/services/statement_service.py lsfx-mock-server/tests/test_statement_service.py
git commit -m "拆分Mock大额交易样本生成骨架"
```
### Task 2: 用单元测试锁定 8 条规则样本口径
**Files:**
- Modify: `lsfx-mock-server/services/statement_rule_samples.py`
- Test: `lsfx-mock-server/tests/test_statement_service.py`
- [ ] **Step 1: Write the failing test**
`tests/test_statement_service.py` 中补一组口径测试,至少覆盖 8 条规则的关键命中特征:
```python
from services.statement_rule_samples import build_large_transaction_seed_statements
def test_large_transaction_seed_should_cover_all_eight_rules():
statements = build_large_transaction_seed_statements(group_id=1000, log_id=20001)
assert any(item["userMemo"] == "购买房产首付款" and item["drAmount"] > 0 for item in statements)
assert any("税" in item["userMemo"] and item["drAmount"] > 0 for item in statements)
assert any(item["crAmount"] > 1111 for item in statements)
assert len([item for item in statements if item["cashType"] == "现金存款"]) >= 1
assert len([item for item in statements if item["customerName"] == "浙江远望贸易有限公司"]) >= 3
```
再补几个针对性断言:
- 同一身份证、同一日期,至少 6 笔 `crAmount > 2000001` 的存现样本
- 至少 1 笔 `userMemo``手机银行转账``drAmount > 100001`
- 所有“收入样本”都避开工资排除词
- [ ] **Step 2: Run test to verify it fails**
Run:
```bash
cd lsfx-mock-server
pytest tests/test_statement_service.py -k "eight_rules or cash_deposit or transfer" -v
```
Expected:
- `FAIL`
- 原因是样本构造函数还没有真实返回 8 条规则所需数据
- [ ] **Step 3: Write minimal implementation**
`statement_rule_samples.py` 中先把 5 组样本簇写实:
```python
IDENTITY_POOL = {
"staff_primary": {"name": "模型测试员工", "id_card": "330101198801010011", "account": "6222024000000001"},
"family_primary": {"name": "模型测试家属", "id_card": "330101199001010022", "account": "6222024000000002"},
"staff_secondary": {"name": "模型二测试员工", "id_card": "330101198802020033", "account": "6222024000000003"},
"family_secondary": {"name": "模型二测试家属", "id_card": "330101199202020044", "account": "6222024000000004"},
}
```
```python
def build_large_transaction_seed_statements(group_id: int, log_id: int) -> list[dict]:
statements = []
statements.extend(build_house_or_car_samples(group_id, log_id))
statements.extend(build_tax_samples(group_id, log_id))
statements.extend(build_income_samples(group_id, log_id))
statements.extend(build_cash_deposit_samples(group_id, log_id))
statements.extend(build_large_transfer_samples(group_id, log_id))
return statements
```
- [ ] **Step 4: Run test to verify it passes**
Run:
```bash
cd lsfx-mock-server
pytest tests/test_statement_service.py -v
```
Expected:
- `PASS`
- 测试输出能证明 8 条规则所需的关键样本全部存在
- [ ] **Step 5: Commit**
```bash
git add lsfx-mock-server/services/statement_rule_samples.py lsfx-mock-server/tests/test_statement_service.py
git commit -m "补充Mock大额交易八条规则样本"
```
### Task 3: 重构 StatementService 合并样本与噪声流水
**Files:**
- Modify: `lsfx-mock-server/services/statement_service.py`
- Test: `lsfx-mock-server/tests/test_statement_service.py`
- [ ] **Step 1: Write the failing test**
补两个服务层测试:
```python
def test_generate_statements_should_fill_noise_up_to_requested_count():
service = StatementService()
statements = service._generate_statements(group_id=1000, log_id=20001, count=80)
assert len(statements) == 80
def test_get_bank_statement_should_keep_same_cached_result_for_same_log_id():
service = StatementService()
page1 = service.get_bank_statement({"groupId": 1000, "logId": 30001, "pageNow": 1, "pageSize": 20})
page2 = service.get_bank_statement({"groupId": 1000, "logId": 30001, "pageNow": 1, "pageSize": 20})
assert page1["data"]["bankStatementList"] == page2["data"]["bankStatementList"]
```
- [ ] **Step 2: Run test to verify it fails**
Run:
```bash
cd lsfx-mock-server
pytest tests/test_statement_service.py -k "fill_noise or cached_result" -v
```
Expected:
- `FAIL`
- 原因是当前 `_generate_statements()` 会直接按 `count` 循环生成单条随机流水,既不保底命中样本,也没有样本与噪声的装配逻辑
- [ ] **Step 3: Write minimal implementation**
`statement_service.py` 改成以下结构:
```python
def _generate_statements(self, group_id: int, log_id: int, count: int) -> List[Dict]:
seed_statements = build_large_transaction_seed_statements(group_id, log_id)
total_count = max(count, len(seed_statements))
noise_count = total_count - len(seed_statements)
statements = list(seed_statements)
for index in range(noise_count):
statements.append(self._generate_random_statement(index, group_id, log_id))
statements = self._assign_statement_ids(statements, log_id)
random.shuffle(statements)
return statements
```
同时新增一个专用方法,为样本和噪声统一补齐:
- `bankStatementId`
- `bankTrxNumber`
- `uploadSequnceNumber`
- `batchId`
- `groupId`
- [ ] **Step 4: Run test to verify it passes**
Run:
```bash
cd lsfx-mock-server
pytest tests/test_statement_service.py -v
```
Expected:
- `PASS`
- 同一 `logId` 首次生成后被缓存,后续分页请求不会漂移
- [ ] **Step 5: Commit**
```bash
git add lsfx-mock-server/services/statement_service.py lsfx-mock-server/tests/test_statement_service.py
git commit -m "重构Mock流水生成并保留分页缓存一致性"
```
### Task 4: 扩展 API 测试覆盖分页与命中特征
**Files:**
- Modify: `lsfx-mock-server/tests/test_api.py`
- Modify: `lsfx-mock-server/tests/integration/test_full_workflow.py`
- [ ] **Step 1: Write the failing test**
在 API 层增加最小回归:
```python
def test_get_bank_statement_should_return_seed_samples(client):
response = client.post(
"/watson/api/project/getBSByLogId",
data={"groupId": 1000, "logId": 40001, "pageNow": 1, "pageSize": 200}
)
assert response.status_code == 200
items = response.json()["data"]["bankStatementList"]
assert any(item["userMemo"] == "购买房产首付款" for item in items)
```
再补一个分页拼接测试,确认把前几页拼起来后可以找到:
- 税务样本
- 至少 6 笔同日存现
- 大额转账样本
- [ ] **Step 2: Run test to verify it fails**
Run:
```bash
cd lsfx-mock-server
pytest tests/test_api.py -k "seed_samples or bank_statement" -v
```
Expected:
- `FAIL`
- 原因是接口返回的仍是完全随机流水,无法稳定断言命中特征
- [ ] **Step 3: Write minimal implementation**
这里主要是补充测试,不引入新的 API 逻辑分支。若前一任务已完成,接口应天然满足这些断言;如仍失败,只允许回到 `statement_rule_samples.py``statement_service.py` 修正数据构造,不要在路由层堆业务判断。
- [ ] **Step 4: Run test to verify it passes**
Run:
```bash
cd lsfx-mock-server
pytest tests/test_api.py -v
pytest tests/integration/test_full_workflow.py -v
```
Expected:
- `PASS`
- 现有上传、拉取、查询等主流程回归不受影响
- [ ] **Step 5: Commit**
```bash
git add lsfx-mock-server/tests/test_api.py lsfx-mock-server/tests/integration/test_full_workflow.py
git commit -m "补充Mock流水分页与命中特征回归测试"
```
### Task 5: 补实施记录并完成最终验证
**Files:**
- Create: `docs/reports/implementation/2026-03-18-lsfx-mock-large-transaction-implementation.md`
- Modify: `docs/plans/backend/2026-03-18-lsfx-mock-large-transaction-backend-implementation.md`
- [ ] **Step 1: Write the implementation report skeleton**
记录以下内容:
- 改动文件清单
- 8 条规则对应的样本策略
- 实际跑过的 pytest 命令
- 已知约束:身份池必须存在于目标后端环境
建议骨架:
```markdown
# LSFX Mock 大额交易样本实施记录
## 变更概述
- ...
## 验证记录
- `cd lsfx-mock-server && pytest tests/test_statement_service.py -v`
- `cd lsfx-mock-server && pytest tests/test_api.py -v`
```
- [ ] **Step 2: Run final verification**
Run:
```bash
cd lsfx-mock-server
pytest tests/test_statement_service.py -v
pytest tests/test_api.py -v
pytest tests/integration/test_full_workflow.py -v
python verify_implementation.py
```
Expected:
- 所有测试通过
- `verify_implementation.py` 不报接口字段缺失
- [ ] **Step 3: Stop any started services**
如果验证过程中启动了:
- `python main.py`
- `uvicorn main:app --reload --host 0.0.0.0 --port 8000`
必须在结束时关闭对应进程,避免残留端口占用。
- [ ] **Step 4: Commit**
```bash
git add docs/reports/implementation/2026-03-18-lsfx-mock-large-transaction-implementation.md docs/plans/backend/2026-03-18-lsfx-mock-large-transaction-backend-implementation.md
git commit -m "补充Mock大额交易样本实施记录"
```