BigQuery Agent Analytics Plugin: 6 validated bugs (status reporting, schema upgrade, shutdown, PII, truncation, dead code)

## Summary

Code review of the current `main` file at [`bigquery_agent_analytics_plugin.py`](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py) identified 6 issues, all validated against `HEAD` (`dd0851ac`).

---

## Priority Findings

### 1. High — LLM_ERROR and TOOL_ERROR events logged with `status="OK"` instead of `"ERROR"`

**Evidence:**
- `EventData.status` defaults to `"OK"` ([L1798](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L1798))
- `_log_event` writes `event_data.status` to the row ([L2539](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L2539))
- `on_model_error_callback` sets `error_message` but **not** `status` ([L2987–2996](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L2987))
- `on_tool_error_callback` sets `error_message` but **not** `status` ([L3107–3118](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L3107))

**Impact:** Dashboards/queries filtering on `status='ERROR'` will undercount failures.

**Suggested fix:** Set `status="ERROR"` in both error callbacks, and optionally enforce in `_log_event` when `event_type.endswith("_ERROR")`.

---

### 2. Medium — Auto schema upgrade only checks top-level columns; nested schema evolution not handled

**Evidence:**
- `_maybe_upgrade_schema` diffs only top-level field names ([L2157–2158](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L2157))
- Version label is stamped unconditionally, even if full expected schema is not reached ([L2169–2172](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L2169))

**Impact:** Future nested field additions can cause Write API schema mismatch while the version label says "upgraded", preventing retries.

**Suggested fix:** Recursive schema diff for nested RECORD fields, or avoid stamping the new version if the full expected schema is not reached.

---

### 3. Medium — Multi-loop shutdown drains only current loop queue; other loops may lose buffered events

**Evidence:**
- Only the current loop calls `batch_processor.shutdown()` ([L2243–2244](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L2243))
- Other loops get raw `transport.close()` without draining ([L2247–2253](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L2247))

**Impact:** Cross-loop runs can silently drop pending rows at shutdown.

**Suggested fix:** Coordinate per-loop shutdown on owning loops (e.g. with `run_coroutine_threadsafe`), then close transports.

---

### 4. Medium — Session metadata/state logged without truncation or redaction, enabled by default

**Evidence:**
- `log_session_metadata` defaults to `True` ([L494](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L494))
- `session.state` is directly included in `_enrich_attributes` without any truncation ([L2444–2446](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L2444)), unlike `usage_metadata` which goes through `_recursive_smart_truncate`

**Impact:** Potential PII leakage and oversized rows in BigQuery.

**Suggested fix:** Add caps/redaction controls (`max_session_state_bytes`, `redact_keys`, `allowlist_keys`) and apply `_recursive_smart_truncate` before serialization.

---

### 5. Low — Large system instruction strings not truncated in LlmRequest parsing path

**Evidence:**
- String `system_instruction` is directly assigned without truncation ([L1409–1410](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L1409))
- Other content paths apply truncation: dict/list via `_recursive_smart_truncate` ([L1422–1425](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L1422)), plain strings via `process_text()` ([L1426–1427](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L1426)), Content objects via `_parse_content_object` ([L1412–1416](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L1412))

**Impact:** Inconsistent size controls; string system prompts bypass all truncation/offloading, increasing row-size risk.

**Suggested fix:** Pass the string through `_truncate` (and optionally offload if configured) like other text content.

---

### 6. Low — `_HITL_TOOL_NAMES` is unused dead code

**Evidence:**
- Declared at [L81](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L81) with zero references elsewhere in the file
- The related `_HITL_EVENT_MAP` ([L86](https://github.com/google/adk-python/blob/main/src/google/adk/plugins/bigquery_agent_analytics_plugin.py#L86)) is used, but `_HITL_TOOL_NAMES` itself is never referenced

**Suggested fix:** Remove, or wire into validation/filtering logic.

---

## Feature Requests That Would Improve Reliability

1. **Plugin health telemetry counters**: `queue_dropped_count`, `batch_retry_count`, `batch_drop_count`, `offload_fail_count`
2. **Dead-letter sink** for failed batches (GCS/BQ table) with sampled payload/error context
3. **Conformance tests** for invariants: `_ERROR` events must produce `status='ERROR'`, schema upgrade handles nested additive changes, and multi-loop shutdown drains all queues

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BigQuery Agent Analytics Plugin: 6 validated bugs (status reporting, schema upgrade, shutdown, PII, truncation, dead code) #4694

Summary

Priority Findings

1. High — LLM_ERROR and TOOL_ERROR events logged with `status="OK"` instead of `"ERROR"`

2. Medium — Auto schema upgrade only checks top-level columns; nested schema evolution not handled

3. Medium — Multi-loop shutdown drains only current loop queue; other loops may lose buffered events

4. Medium — Session metadata/state logged without truncation or redaction, enabled by default

5. Low — Large system instruction strings not truncated in LlmRequest parsing path

6. Low — `_HITL_TOOL_NAMES` is unused dead code

Feature Requests That Would Improve Reliability

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

BigQuery Agent Analytics Plugin: 6 validated bugs (status reporting, schema upgrade, shutdown, PII, truncation, dead code) #4694

Description

Summary

Priority Findings

1. High — LLM_ERROR and TOOL_ERROR events logged with status="OK" instead of "ERROR"

2. Medium — Auto schema upgrade only checks top-level columns; nested schema evolution not handled

3. Medium — Multi-loop shutdown drains only current loop queue; other loops may lose buffered events

4. Medium — Session metadata/state logged without truncation or redaction, enabled by default

5. Low — Large system instruction strings not truncated in LlmRequest parsing path

6. Low — _HITL_TOOL_NAMES is unused dead code

Feature Requests That Would Improve Reliability

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

1. High — LLM_ERROR and TOOL_ERROR events logged with `status="OK"` instead of `"ERROR"`

6. Low — `_HITL_TOOL_NAMES` is unused dead code