hc ced50ea2b0 feat(agent): add result-verifier for blind visual comparison

Root cause: test-runner was giving overly optimistic results due to:
1. Context bias - knew the implementation, tended to defend it
2. No actual visual comparison - just wrote 'ACCEPTABLE' without looking
3. No structural validation - accepted 35x scale differences as 'acceptable'

Solution:
- New result-verifier agent that performs blind visual comparison
- Strict pass/fail criteria for structural validation
- Updated test-runner to use result-verifier for each figure
- Clear guidelines: structural mismatches = FAIL, not ACCEPTABLE

Test result: verifier correctly identified Fig3 as FAIL with 7 specific issues:
- Wrong X-axis variable (channels vs power)
- Wrong Y-axis scale (5x difference)
- Wrong curve count (5 vs 4)
- etc.

2026-03-31 23:56:36 +08:00

5.4 KiB

Raw Blame History

Paper Structure Analysis

Basic Information

Title: Resource Allocation for Text Semantic Communications
Authors: Lei Yan, Zhijin Qin, Rui Zhang, Yongzhao Li, Geoffrey Ye Li
Year: 2022
Venue: IEEE Wireless Communications Letters

Abstract Summary

This paper introduces semantic spectral efficiency (S-SE) as a new metric to measure communication efficiency from a semantic perspective. Taking text semantic communication (using DeepSC) as an example, the authors formulate and solve a resource allocation problem to maximize overall S-SE via channel assignment and semantic symbol length optimization. A transform method is also proposed for fair comparison between bit-based and semantic-based communication systems.

Problem Statement

Conventional communications use bit-based spectral efficiency, which is not applicable for semantic communications as bits are irrelevant to the meaning of the source. Resource allocation needs to be rethought from the semantic perspective to maximize communication efficiency while guaranteeing transmission reliability in semantic-aware networks.

Key Contributions

Proposing a novel resource allocation model for semantic-aware networks by defining Semantic Spectral Efficiency (S-SE) for the first time.
Formulating and solving an optimization problem to maximize overall S-SE in terms of channel assignment and the number of transmitted semantic symbols.
Developing a transform method to convert bit-based SE to S-SE to make fair comparisons between semantic and conventional communication systems.

Method Overview

Architecture

The system consists of a cellular network with a base station and multiple users. DeepSC is adopted as the semantic communication model for text transmission, utilizing Transformer architecture to map sentences to semantic symbols. The symbol vector length varies based on the sentence length and the average number of semantic symbols per word (k_n). The receiver decodes the symbols using a channel decoder and semantic decoder, evaluated by BERT-level semantic similarity. Reference to Figure 1: The structure of semantic-aware networks from image_understanding.md.

Key Components

Component	Description	Implementation Priority
Semantic Spectral Efficiency (S-SE) Metric	Defines the effectively transmitted semantic information over a unit of bandwidth.	High
DeepSC Transmitter/Receiver	Transformer-based semantic encoder/decoder.	Low (Pre-trained look-up table used)
Resource Allocation Optimizer	Solves for optimal `k_n` and channel assignment `\alpha_{n,m}` using exhaustive search and the Hungarian algorithm.	High
Transform Method & Baselines	Converts conventional bit-based SE to S-SE based on a transforming factor `\mu` (bits/word). Evaluates Ideal, 4G, and 5G baselines.	High

Mathematical Formulation

S-R (Semantic transmission rate) and S-SE (Semantic spectral efficiency):


\Gamma_{n, m} = \frac{W I}{k_n L} \xi_{n, m}


\Phi_{n, m} = \frac{\Gamma_{n, m}}{W} = \frac{I}{k_n L} \xi_{n, m}

Optimization Objective (P0/P1) to maximize total S-SE:


\max_{\boldsymbol{\alpha}_n, k_n} \widetilde{\Phi} = \sum_{n=1}^{N} \sum_{m=1}^{M} \alpha_{n, m} \frac{\xi_{n, m}}{k_n}

Training Details

Optimizer: DeepSC is pre-trained; the resource allocation uses Hungarian algorithm & exhaustive search.
Hardware: Assumed pre-trained at the BS or cloud platforms.
Note: The paper abstracts the DeepSC performance into a mapping between \xi_{n,m} (semantic similarity), k_n, and SNR \gamma_{n,m} over an AWGN channel.

Experiments

Datasets

Dataset	Size	Purpose
Synthetic Cellular Network	N=5, M=5 (default)	Resource allocation optimization simulation
Text dataset (implicit)	N/A	To obtain DeepSC semantic similarity performance look-up table

Metrics

Semantic similarity (\xi): Evaluated using pre-trained Sentence-BERT model.
Semantic Spectral Efficiency (S-SE): Measured in suts/s/Hz.
Transforming factor (\mu): Measured in bits/word.

Key Results

The proposed resource allocation model maximizes S-SE, significantly outperforming conventional models with fixed k_n.
The semantic communication system achieves higher S-SE than 4G and 5G systems for text transmission when the transforming factor \mu > 19 bits/word.
It outperforms the ideal Shannon limit system when \mu > 27 bits/word. Reference to Figure 3, Figure 4a, Figure 4b, Figure 4c from image_understanding.md.

Appendix Notes

No supplementary material findings explicitly stated, but baseline implementation details refer to 3GPP TS 36.213 (4G) and TS 38.214 (5G) tables for CQI to SE mapping.

Data Source Labeling

Figure 3: S-SE with different models

Data Point	Value	Source	Reliability
S-SE Proposed Model Plateau	~1.2	Image extraction	REFERENCE ONLY

Figure 4a, 4b, 4c: Comparison curves

Data Point	Value	Source	Reliability
S-SE Semantic (M>=5)	~1.2	Image extraction	REFERENCE ONLY
Semantic transmit power plateau	~10 dBm	Image extraction	REFERENCE ONLY
`\mu` cross point (Semantic vs 5G/4G)	19	Paper text, Section IV.C	HIGH
`\mu` cross point (Semantic vs Ideal)	27	Paper text, Section IV.C	HIGH

5.4 KiB Raw Blame History