License: arXiv.org perpetual non-exclusive license
arXiv:2604.14524v2 [eess.SP] 23 Apr 2026

Bridging Standardized Codebook and Site-Specific Beamforming: A Unified Limited-Feedback Framework

Cheng-Jie Zhao, Zhaolin Wang, , Zongyao Zhao, , and Yuanwei Liu The authors are with Department of Electrical and Computer Engineering, The University of Hong Kong, Hong Kong (e-mail: chengjie_zhao@connect.hku.hk; zhaolin.wang@hku.hk; zongyao@hku.hk; yuanwei@hku.hk).
Abstract

A site-specific Type-II codebook design is proposed for downlink massive multiple-input multiple-output (MIMO) limited-feedback beamforming. The key idea is to embed a learned site-specific propagation prior into the Type-II channel state information (CSI) feedback pipeline. Specifically, the base station (BS) uses a low-overhead reference signal received power (RSRP) fingerprint collected during synchronization signal block (SSB) probing to infer a user equipment (UE)-dependent dominant beam subspace before explicit CSI acquisition. The UE then estimates and feeds back only the low-dimensional effective channel coefficients within this inferred subspace, thereby avoiding full-dimensional online subspace discovery while retaining a rich multi-beam representation capability. To analyze the proposed design and compare it with standardized feedback mechanisms, a unified subspace-projection framework is developed by jointly characterizing CSI acquisition, UE-side compression, BS-side reconstruction, and effective spectral efficiency. Under this framework, Type-I, Type-II, port-selection feedback, and the proposed scheme are interpreted as different ways of inducing a feedback representation subspace. The probing codebook and the BS-side subspace inference network are then formulated as a coupled task-oriented design problem and are optimized end-to-end by maximizing the normalized CSI-capture efficiency. Extensive simulation results demonstrate that the proposed feedback scheme achieves Type-II-comparable CSI-capture capability with substantially lower online overhead and UE-side complexity, thereby improving the effective spectral efficiency.

I Introduction

BEAMFORMING is a key enabling technology for modern cellular systems, especially in massive multiple-input multiple-output (MIMO) and high-frequency deployments where directional transmission is indispensable for compensating path loss and improving spectral efficiency [Heath2016mmWave]. In practical downlink (DL) systems, however, high-quality beamforming critically depends on how accurately the base station (BS) can acquire the user equipment (UE)-specific channel state information (CSI). This challenge is particularly pronounced in frequency-division-duplex (FDD) systems and in scenarios where reciprocity-based acquisition is imperfect or unavailable [Hassibi2003Training]. In such cases, the DL CSI must be learned through BS-side reference-signal (RS) transmission, UE-side channel estimation and compression, and subsequent CSI feedback. The resulting overhead can become substantial when the antenna dimension is large and the channel varies rapidly, which makes limited-feedback beamforming a fundamental yet challenging design problem [Love2008LimitedFeedback].
Current 3rd Generation Partnership Project (3GPP) New Radio (NR) systems address this problem through standardized codebook-based CSI feedback mechanisms specified in TS 38.214 [3GPP38214], among which Type-I, Type-II, and enhanced Type-II port-selection codebook (PSC) schemes constitute the main design options. These schemes offer different tradeoffs between signaling overhead, UE-side complexity, and beamforming accuracy. Type-I relies on single-beam quantization and therefore has the lowest feedback and computational cost, but its representation capability is limited in multipath environments. Type-II provides a more flexible multi-beam subspace representation and often achieves much better beamforming quality, but this gain comes at the price of full-dimensional CSI acquisition, heavier beam search, and higher UE-side processing and feedback overhead. PSC provides a structured alternative, but its representation capability is still constrained by the selected port domain. As the core codebook-based feedback schemes in NR, they have been extensively discussed in the broader literature on limited feedback, beam management, and 3GPP MIMO codebooks [Love2008LimitedFeedback, Giordani2019BeamManagementNR, Fu2023TutorialCodebooks].
Existing studies have mainly followed two separate lines. One line remains within the standardized or near-standardized codebook-based feedback framework and seeks to improve beam management or CSI feedback while preserving compatibility with practical NR signaling structures and feedback formats. In particular, the authors of [Dreifuerst2024MLCodebook] proposed a machine-learning-based codebook design for NR initial access and Type-II feedback . In a subsequent work, they further developed a neural codebook design framework for broader MIMO network beam-management tasks [Dreifuerst2025NeuralCodebook]. These works show that data-driven codebook design can improve practical beam management under standardized signaling constraints. However, they still treat Type-I, Type-II, and PSC largely within their individual formulations, and therefore do not provide a common analytical lens for systematically exposing the intrinsic overhead-complexity-performance tradeoffs among the conventional feedback mechanisms.
A separate line of research moves toward site-specific or environment-aware beamforming, where learned propagation structure or site-specific measurements are exploited for probing, beam alignment, or beam synthesis. For example, the authors of [Heng2022SiteSpecificProbing] studied site-specific probing and later developed a grid-free beam-alignment framework in [Heng2024GridFree] based on learned site-dependent measurements. The authors of [Ning2023RSRPCodebook] proposed an RSRP-based environment-adaptive codebook design , while the authors of [Wu2024CKMBeamforming] investigated environment-aware hybrid beamforming by leveraging channel knowledge maps. More recently, generative site-specific beamforming frameworks proposed in [sim, SSBFMAG2] have further shown that coarse environment-dependent observations can already be highly informative for downstream beamforming and subspace inference . However, these works usually depart from the current standardized limited-feedback architecture and therefore do not explain how site-specific designs should be systematically incorporated into a limited-feedback system, nor how the resulting gain should be quantified under the same overhead-efficiency metric as conventional feedback schemes.
Taken together, these two lines of research reveal a missing middle ground. Existing codebook-based feedback schemes preserve practical NR signaling structures, but they still rely on online dominant-subspace discovery from high-dimensional CSI at the UE, which leads to substantial CSI acquisition, feedback, and UE-side processing overhead. By contrast, site-specific and environment-aware beamforming works suggest that wireless propagation around a BS is often not arbitrary: persistent geometric features such as dominant reflectors, blockage patterns, and street layouts induce recurring low-dimensional channel structures that can be learned from historical or simulated site-specific channel data. In this paper, this learned propagation prior is referred to as site-specific information (SSI). However, existing site-specific methods usually fall outside the standardized limited-feedback architecture and therefore do not explain how such prior knowledge should be systematically incorporated into a practical limited-feedback system. This observation motivates using SSI not as a replacement for CSI feedback, but as prior information to assist the feedback process itself. Specifically, we develop an SSI-enhanced limited-feedback scheme in which the BS combines the learned site-specific prior with a low-overhead reference signal received power (RSRP) fingerprint collected during the synchronization signal block (SSB) probing stage to infer a UE-dependent dominant subspace before explicit CSI acquisition. The UE then only needs to estimate and feed back the low-dimensional effective CSI coefficients within that inferred subspace. In this way, the proposed design preserves the practical limited-feedback structure while reducing online CSI acquisition overhead and UE-side complexity. The main contributions are summarized as follows:

  • We develop a unified optimization framework for limited-feedback beamforming, where CSI acquisition, UE-side compression, BS-side reconstruction, and the resulting spectral efficiency are jointly characterized from a subspace-projection perspective. This framework also unifies the interpretation of standardized NR and the proposed schemes, enabling a theoretical analysis of their CSI-capture efficiency, overhead, and computational complexity tradeoffs.

  • We propose a site-specific Type-II feedback scheme, where the BS leverages offline learned SSI and a low-overhead RSRP fingerprint from SSB probing to infer a UE-dependent dominant beam subspace before explicit CSI acquisition. Based on this subspace, the UE only needs to estimate and feed back low-dimensional effective channel coefficients, thus retaining the representation advantage of Type-II while substantially reducing online CSI acquisition overhead.

  • We propose a learning-based end-to-end method for the joint design of the probing codebook and the BS-side subspace inference network. In particular, the two components are co-optimized to ensure that the probing stage generates highly informative RSRP fingerprints while the inference stage delivers accurate dominant-subspace predictions, thereby maximizing CSI-capture efficiency under limited overhead.

  • Extensive simulations show that the proposed framework achieves a significantly better overhead-efficiency tradeoff than conventional feedback schemes. In particular, the learned probing design consistently outperforms competing baselines, while the proposed site-specific Type-II scheme attains CSI-capture capability on par with Type-II with substantially lower online CSI acquisition overhead and UE-side complexity.

The remainder of this paper is organized as follows. Section II introduces the system model, including the geometric channel model and the SSB-based initial access procedure. Section III develops the unified optimization framework for limited-feedback beamforming. Section IV reinterprets the conventional Type-I, Type-II, and PSC mechanisms within this framework. Section V presents the proposed site-specific Type-II feedback scheme and its theoretical analysis. Section VI formulates the joint design of the site-specific probing codebook and the subspace inference rule and develops the corresponding learning-based solver. Section VII provides numerical results. Finally, Section VIII concludes the paper.
Notation: Scalars, vectors, matrices, and sets are denoted by italic letters, boldface lowercase letters, boldface uppercase letters, and calligraphic letters, respectively. ()T({\cdot})^{T}, ()H({\cdot})^{H}, and ()({\cdot})^{\dagger} denote the transpose, Hermitian transpose, and Moore–Penrose pseudoinverse, respectively. \|\cdot\| and |||\cdot| denote the Euclidean norm and the absolute value, respectively. span()\mathrm{span}(\cdot) denotes the subspace spanned by its argument, 𝐈N\mathbf{I}_{N} denotes the N×NN\times N identity matrix, and 𝔼[]\mathbb{E}[\cdot] denotes the expectation operator. 𝒞𝒩(μ,𝚺)\mathcal{CN}(\mu,\mathbf{\Sigma}) denotes the circularly symmetric complex Gaussian (CSCG) distribution with mean μ\mu and covariance 𝚺\mathbf{\Sigma}.

II System Model

We consider a single-cell DL system in which a BS equipped with NtN_{t} fully digital transmit antennas serves a single-antenna UE at a time. The UE is located on a two-dimensional (2D) plane, and the DL channel is assumed to be block fading, i.e., approximately constant within one coherence interval. The beam management and feedback procedures considered in this paper are carried out within such a coherence interval.

II-A Channel Model

We adopt a sparse geometric channel model, which represents the channel as a superposition of LL dominant propagation paths [Ayach2014SpatiallySparse]. Let 𝐡Nt×1\mathbf{h}\in\mathbb{C}^{N_{t}\times 1} denote the instantaneous DL channel vector. It is represented as

𝐡=l=1Lαl𝐚(φl)=𝐀𝜶,\mathbf{h}=\sum_{l=1}^{L}\alpha_{l}\mathbf{a}(\varphi_{l})=\mathbf{A}\bm{\alpha}, (1)

where αl\alpha_{l}\in\mathbb{C} denotes the complex gain of the ll-th path, including large-scale attenuation and small-scale fading, and 𝐚(φl)\mathbf{a}(\varphi_{l}) is the transmit-array steering vector associated with the angle of departure φl\varphi_{l}. We collect the steering vectors and path gains as 𝐀=[𝐚(φ1),,𝐚(φL)]Nt×L\mathbf{A}=[\mathbf{a}(\varphi_{1}),\ldots,\mathbf{a}(\varphi_{L})]\in\mathbb{C}^{N_{t}\times L} and 𝜶=[α1,,αL]TL×1\bm{\alpha}=[\alpha_{1},\ldots,\alpha_{L}]^{T}\in\mathbb{C}^{L\times 1}.

Assumption 1 (Near-orthogonal angular model).

The BS employs a uniform linear array (ULA) with inter-element spacing dd, and the UE lies in the far field of the array. For the ll-th path, define the spatial frequency as uldλsin(φl)u_{l}\triangleq\frac{d}{\lambda}\sin(\varphi_{l}), so that the normalized steering vector is

𝐚(ul)=1Nt[1,ej2πul,,ej2π(Nt1)ul]T.\mathbf{a}(u_{l})=\frac{1}{\sqrt{N_{t}}}\left[1,e^{j2\pi u_{l}},\cdots,e^{j2\pi(N_{t}-1)u_{l}}\right]^{T}. (2)

The dominant path spatial frequencies are sufficiently separated, and the array aperture is large enough to resolve them. Consequently, the off-diagonal correlations among the dominant steering vectors are small, and the array response matrix satisfies the approximate orthogonality condition

𝐀H𝐀𝐈L.\mathbf{A}^{H}\mathbf{A}\approx\mathbf{I}_{L}. (3)

Assumption 1 is the channel-structure condition used in the subsequent path-capture analysis. It implies that, in sparse site-specific environments, the channel energy can be approximately decomposed across a small number of resolvable angular components, which makes a subspace-capture interpretation meaningful. With a unit-power transmit beamformer 𝐰Nt×1\mathbf{w}\in\mathbb{C}^{N_{t}\times 1} and data symbol ss, the UE receives

y=Pt𝐡H𝐰s+n,y=\sqrt{P_{t}}\mathbf{h}^{H}\mathbf{w}s+n, (4)

where PtP_{t} is the transmit power and n𝒞𝒩(0,σn2)n\sim\mathcal{CN}(0,\sigma_{n}^{2}) is temporally independent additive noise.

II-B SSB-Based Initial Access and RSRP Fingerprint

Before CSI feedback, the UE first performs initial access through SSB beam sweeping. Specifically, the BS transmits a predefined SSB probing codebook 𝐁=[𝐛1,,𝐛K]Nt×K\mathbf{B}=[\mathbf{b}_{1},\ldots,\mathbf{b}_{K}]\in\mathbb{C}^{N_{t}\times K} [3GPP38211, 3GPP38215]. For the ii-th probing beam, the received SSB signal is modeled as

𝐲SSB,i=PSSB𝐡H𝐛i𝐬SSB+𝐧SSB,i,{{\mathbf{y}}_{{\rm{SSB}},i}}=\sqrt{P_{\text{SSB}}}\mathbf{h}^{H}\mathbf{b}_{i}{{\bf{s}}_{{\rm{SSB}}}}+{\bf{n}}_{{\rm{SSB}},i}, (5)

where PSSBP_{\text{SSB}} is the SSB transmit power, 𝐬SSBLs×1\mathbf{s}_{\rm SSB}\in\mathbb{C}^{L_{s}\times 1} collects the SSB symbols used for this measurement, and 𝐧SSB,i𝒞𝒩(𝟎Ls,σn2𝐈Ls)\mathbf{n}_{{\rm SSB},i}\sim\mathcal{CN}(\mathbf{0}_{L_{s}},\sigma_{n}^{2}\mathbf{I}_{L_{s}}) is the corresponding noise vector 111In NR, SSB measurements are obtained over specific time-frequency resource elements. For analytical compactness, we aggregate these resource elements into a vector and do not separately track their time-frequency indices.. The UE measures the RSRP of the ii-th beam by averaging the received SSB power over the aggregated SSB symbols,

RSRP𝐛i1Lst=1Ls|ySSB,i(t)|2.\text{RSRP}_{\mathbf{b}_{i}}\triangleq\frac{1}{L_{s}}\sum_{t=1}^{L_{s}}\left|y_{{\rm SSB},i}^{(t)}\right|^{2}. (6)

Following [sim], the dB-domain RSRP can be modeled as a noisy observation of an ideal beam power,

r𝐛i=10log10RSRP𝐛i=r𝐛i0+n𝐛i,r_{\mathbf{b}_{i}}=10\log_{10}\text{RSRP}_{\mathbf{b}_{i}}=r_{\mathbf{b}_{i}}^{0}+n_{\mathbf{b}_{i}}, (7)

where r𝐛i0r_{\mathbf{b}_{i}}^{0} is the noise-free dB-domain RSRP value and n𝐛in_{\mathbf{b}_{i}} is a Gaussian perturbation with mean μb\mu_{b} and variance σb2\sigma_{b}^{2}. The detailed expressions of r𝐛i0r_{\mathbf{b}_{i}}^{0}, μb\mu_{b}, and σb2\sigma_{b}^{2} can be found in Eq. (7), Eq. (8a), and Eq. (8b) of [sim]. After sweeping all KK SSB beams, the UE obtains the RSRP fingerprint

𝐫𝐁=𝐫𝐁0+𝐧𝐁,\mathbf{r}_{\mathbf{B}}=\mathbf{r}_{\mathbf{B}}^{0}+\mathbf{n}_{\mathbf{B}}, (8)

where 𝐫𝐁0=[r𝐛10,,r𝐛K0]T\mathbf{r}_{\mathbf{B}}^{0}=[r_{\mathbf{b}_{1}}^{0},\ldots,r_{\mathbf{b}_{K}}^{0}]^{T} and 𝐧𝐁=[n𝐛1,,n𝐛K]T\mathbf{n}_{\mathbf{B}}=[n_{\mathbf{b}_{1}},\ldots,n_{\mathbf{b}_{K}}]^{T}. During initial access, the UE measures 𝐫𝐁\mathbf{r}_{\mathbf{B}} to determine the access beam and the associated random-access resources before subsequent CSI acquisition.
The above initial-access stage only provides coarse beam-level information for synchronization and access establishment, but it is generally insufficient for high-quality downlink beamforming. To support data transmission, the BS still needs a more refined UE-specific CSI representation, since the resulting beamforming performance critically depends on the quality of CSI acquisition. In practical limited-feedback systems, this requires additional CSI-reference signal (CSI-RS) transmission, UE-side channel estimation, CSI compression, and feedback. When the transmit-antenna dimension is large, such CSI acquisition can incur substantial training, feedback, and processing overhead. Therefore, the key issue is how efficiently this representation can be acquired under limited online resources.

III A Unified Framework for Limited-Feedback Beamforming

III-A CSI Acquisition and Feedback Pipeline

Recall the DL data transmission model in (4). If the DL CSI 𝐡\mathbf{h} is available at the BS, the single-user single-stream beamforming problem can be characterized as

𝐰=argmax𝐰:𝐰2=1|𝐡H𝐰|2,\mathbf{w}^{\star}=\arg\max_{\mathbf{w}:\|\mathbf{w}\|^{2}=1}|\mathbf{h}^{H}\mathbf{w}|^{2}, (9)

with the optimal solution to be the matched-filter or maximum-ratio-transmission (MRT) beamformer 𝐰=𝐡/𝐡\mathbf{w}^{\star}=\mathbf{h}/\|\mathbf{h}\|. In practice, however, the BS does not directly know the UE-specific DL channel. In TDD systems, uplink sounding and channel reciprocity may provide a useful estimate [Larsson2014MassiveMIMO], but calibration and refinement can still be required when reciprocity is imperfect or when the full-dimensional channel is not directly observable. In FDD systems, the DL CSI must instead be estimated at the UE and fed back to the BS.
We abstract this DL acquisition process as three coupled stages: 1) BS-side CSI-RS transmission, 2) UE-side CSI extraction and compression, and 3) BS-side reconstruction. Let 𝐒CSINc×Lc\mathbf{S}_{\text{CSI}}\in\mathbb{C}^{N_{c}\times L_{c}} denote the CSI-RS training matrix transmitted over LcL_{c} channel uses from NcN_{c} CSI-RS ports, which represent effective pilot transmission dimension seen by the UE, rather than necessarily a physical transmit antenna, and let 𝐂Nt×Nc\mathbf{C}\in\mathbb{C}^{N_{t}\times N_{c}} denote the CSI-RS precoder 222As for SSB, CSI-RS symbols over time-frequency resource elements are aggregated into a training matrix for analytical compactness.. The received CSI-RS is given by

𝐲CSIT(𝐂)=PCSI𝐡H𝐂𝐒CSI+𝐧CSIT,\mathbf{y}_{\text{CSI}}^{T}(\mathbf{C})=\sqrt{P_{\text{CSI}}}\,\mathbf{h}^{H}\mathbf{C}\mathbf{S}_{\text{CSI}}+\mathbf{n}_{\text{CSI}}^{T}, (10)

where PCSIP_{\text{CSI}} is the CSI-RS transmit power and 𝐧CSI𝒞𝒩(𝟎Lc,σn2𝐈Lc)\mathbf{n}_{\text{CSI}}\sim\mathcal{CN}(\mathbf{0}_{L_{c}},\sigma_{n}^{2}\mathbf{I}_{L_{c}}). Based on 𝐲CSI(𝐂)\mathbf{y}_{\text{CSI}}(\mathbf{C}), the UE obtains a CSI representation through an acquisition mapping \mathcal{R}, e.g., least-squares (LS) or minimum mean square error (MMSE) estimation [Hassibi2003Training]. In general, LcL_{c} should be no less than NcN_{c} to reliably recover 𝐡\mathbf{h}. Hence, the overhead for DL channel estimation is at least 𝒪(Nc2)\mathcal{O}(N_{c}^{2}). To avoid distraction from estimation error, we assume perfect acquisition under the required training dimension and write 𝐡=(𝐲CSI)\mathbf{h}=\mathcal{R}(\mathbf{y}_{\text{CSI}}) with :Lc×1Nt×1\mathcal{R}:\mathbb{C}^{L_{c}\times 1}\rightarrow\mathbb{C}^{N_{t}\times 1}.
Since feeding back full CSI is generally prohibitive, the UE compresses the acquired CSI into a lower-dimensional feedback message

𝐳=𝒬(𝐡),\mathbf{z}=\mathcal{Q}(\mathbf{h}), (11)

where 𝒬:Nt×1Nq×1\mathcal{Q}:\mathbb{C}^{N_{t}\times 1}\rightarrow\mathbb{C}^{N_{q}\times 1} represents the UE-side compression rule. The dimension NqN_{q} should be understood as a normalized feedback-payload proxy, e.g., the number of scalar coefficients or indices conveyed to the BS. Here, NqN_{q} is only adopted as an abstract channel-use-equivalent feedback overhead. The detailed modeling of the uplink feedback channel is beyond the scope of this paper. After receiving 𝐳\mathbf{z}, the BS reconstructs the channel as

𝐡^=(𝐳),\hat{\mathbf{h}}=\mathcal{F}(\mathbf{z}), (12)

where :Nq×1Nt×1\mathcal{F}:\mathbb{C}^{N_{q}\times 1}\rightarrow\mathbb{C}^{N_{t}\times 1} is the BS-side reconstruction mapping. In general, \mathcal{F} is not the inverse of 𝒬\mathcal{Q}, because the UE-side compression is lossy. As a consequence, the optimal MRT beamformer based on this reconstructed channel is given by

𝐰^=𝐡^𝐡^=(𝒬((𝐲CSI(𝐂))))(𝒬((𝐲CSI(𝐂)))).\hat{\mathbf{w}}=\frac{\hat{\mathbf{h}}}{\|\hat{\mathbf{h}}\|}=\frac{\mathcal{F}\big(\mathcal{Q}(\mathcal{R}(\mathbf{y}_{\text{CSI}}(\mathbf{C})))\big)}{\|\mathcal{F}\big(\mathcal{Q}(\mathcal{R}(\mathbf{y}_{\text{CSI}}(\mathbf{C})))\big)\|}. (13)

III-B Beamforming Design with Subspace Projection

The above pipeline makes the overhead-performance tradeoff explicit. Specifically, the online overhead consists of SSB-stage overhead and CSI acquisition/feedback overhead, which can be calculated as To=TSSB+TCSIT_{o}=T_{\rm SSB}+T_{\rm CSI} in channel uses. Here, TSSBT_{\rm SSB} accounts for SSB sweeping and, when needed, the associated RSRP reporting, while TCSIT_{\rm CSI} accounts for CSI-RS transmission and the subsequent CSI feedback payload. For a coherence block of TcT_{c} channel uses and denoting ρ=Pt/σn2\rho=P_{t}/\sigma_{n}^{2}, the limited-feedback beamforming design problem for maximizing the effective rate can be abstracted as

max𝐂,𝒬,(1ToTc)log2(1+ρ|𝐡H𝐰^|2).\max_{\mathbf{C},\,\mathcal{Q},\,\mathcal{F}}\;\left({1-\frac{{{T_{o}}}}{{{T_{c}}}}}\right){\log_{2}}\left({1+{\rho}{{\left|{{{\mathbf{h}}^{H}}\hat{\mathbf{w}}}\right|}^{2}}}\right). (14)

To streamline the beamforming design, we now introduce a subspace project theory of the mapping given in (13). In particular, 𝒬\mathcal{Q} and \mathcal{F} determine the UE-feedback information and the CSI reconstruction quality, which essentially inducing a scheme-specific CSI representation subspace 𝒰\mathcal{U} together. Let 𝐔\mathbf{U} be a basis matrix such that 𝒰=span(𝐔)\mathcal{U}=\mathrm{span}(\mathbf{U}). The orthogonal projector onto 𝒰\mathcal{U} is then given by 𝐏𝒰=𝐔(𝐔H𝐔)1𝐔H\mathbf{P}_{\mathcal{U}}=\mathbf{U}(\mathbf{U}^{H}\mathbf{U})^{-1}\mathbf{U}^{H}.

Lemma 1 (Subspace-projection characterization).

For any limited-feedback scheme, i.e., 𝒬\mathcal{Q} and \mathcal{F}, whose reconstructed CSI is restricted to a subspace 𝒰\mathcal{U}, the optimal MRT beamformer supported by this subspace is

𝐰^𝒰=𝐏𝒰𝐡𝐏𝒰𝐡,\hat{\mathbf{w}}_{\mathcal{U}}=\frac{\mathbf{P}_{\mathcal{U}}\mathbf{h}}{\|\mathbf{P}_{\mathcal{U}}\mathbf{h}\|}, (15)

where 𝐏𝒰𝐡𝟎\mathbf{P}_{\mathcal{U}}\mathbf{h}\neq\mathbf{0}. The resulting beamforming gain is 𝐏𝒰𝐡2\|\mathbf{P}_{\mathcal{U}}\mathbf{h}\|^{2}. Under Assumption 1, it can be approximated as

𝐏𝒰𝐡2l=1L|αl|2gl,\|\mathbf{P}_{\mathcal{U}}\mathbf{h}\|^{2}\approx\sum_{l=1}^{L}|\alpha_{l}|^{2}g_{l}, (16)

where gl𝐏𝒰𝐚(ul)2[0,1]g_{l}\triangleq\|\mathbf{P}_{\mathcal{U}}\mathbf{a}(u_{l})\|^{2}\in[0,1] is the projected channel gain of the ll-th path.

Proof:

For a beamformer restricted to 𝒰\mathcal{U}, the MRT direction is the normalized projection of 𝐡\mathbf{h} onto 𝒰\mathcal{U}, which yields 𝐰^𝒰=𝐏𝒰𝐡/𝐏𝒰𝐡\hat{\mathbf{w}}_{\mathcal{U}}=\mathbf{P}_{\mathcal{U}}\mathbf{h}/\|\mathbf{P}_{\mathcal{U}}\mathbf{h}\|. Thus,

|𝐡H𝐰^𝒰|2\displaystyle|\mathbf{h}^{H}\hat{\mathbf{w}}_{\mathcal{U}}|^{2} =|𝐡H𝐏𝒰𝐡𝐏𝒰𝐡|2=𝐏𝒰𝐡2\displaystyle=\left|\frac{\mathbf{h}^{H}\mathbf{P}_{\mathcal{U}}\mathbf{h}}{\|\mathbf{P}_{\mathcal{U}}\mathbf{h}\|}\right|^{2}=\|\mathbf{P}_{\mathcal{U}}\mathbf{h}\|^{2} (17)
=𝜶H𝐀H𝐏𝒰𝐀𝜶.\displaystyle=\bm{\alpha}^{H}\mathbf{A}^{H}\mathbf{P}_{\mathcal{U}}\mathbf{A}\bm{\alpha}. (18)

Under Assumption 1, the cross-path correlations are weak and the off-diagonal terms in 𝐀H𝐏𝒰𝐀\mathbf{A}^{H}\mathbf{P}_{\mathcal{U}}\mathbf{A} are negligible, giving 𝐏𝒰𝐡2l=1L|αl|2𝐏𝒰𝐚(ul)2\|\mathbf{P}_{\mathcal{U}}\mathbf{h}\|^{2}\approx\sum_{l=1}^{L}|\alpha_{l}|^{2}\|\mathbf{P}_{\mathcal{U}}\mathbf{a}(u_{l})\|^{2}. Dividing this expression by 𝐡2l=1L|αl|2\|\mathbf{h}\|^{2}\approx\sum_{l=1}^{L}|\alpha_{l}|^{2} completes the proof. ∎

Theorem 1 (CSI-capture efficiency).

Under Lemma 1 and Assumption 1, the CSI-capture efficiency satisfies

η𝐏𝒰𝐡2𝐡2l=1L|αl|2gll=1L|αl|2,\eta\triangleq\frac{\|\mathbf{P}_{\mathcal{U}}\mathbf{h}\|^{2}}{\|\mathbf{h}\|^{2}}\approx\frac{\sum_{l=1}^{L}|\alpha_{l}|^{2}g_{l}}{\sum_{l=1}^{L}|\alpha_{l}|^{2}}, (19)

where gl𝐏𝒰𝐚(ul)2[0,1]g_{l}\triangleq\|\mathbf{P}_{\mathcal{U}}\mathbf{a}(u_{l})\|^{2}\in[0,1] denotes the projected gain of the ll-th path.

Proof:

The result follows directly from Lemma 1. ∎

Lemma 1 and Theorem 1 show that the performance of a limited-feedback scheme is governed by how efficiently its induced representation subspace captures channel energy. The path-capture factor glg_{l} measures the fraction of the ll-th path energy preserved by the scheme, and the overall CSI-capture efficiency is a path-power-weighted average of these factors. Therefore, problem (14) can be approximately rewritten as

max𝐂,𝒬,(1ToTc)log2(1+ρl=1L|αl|2gl).\max_{\mathbf{C},\,\mathcal{Q},\,\mathcal{F}}\;\left({1-\frac{{{T_{o}}}}{{{T_{c}}}}}\right){\log_{2}}\left(1+\rho\sum_{l=1}^{L}|\alpha_{l}|^{2}g_{l}\right). (20)

Problem (14) and (20) are both difficult to solve directly because the CSI acquisition, compression, and reconstruction mappings are coupled. Nevertheless, the subspace-projection viewpoint reveals a useful design principle: a limited-feedback scheme should preserve as much dominant channel energy as possible while minimizing CSI-RS and feedback overhead. The following sections use this viewpoint to reinterpret standardized mechanisms and to motivate the proposed SSI-enhanced design.

IV Conventional Limited-Feedback Schemes

Current 3GPP NR standards do not solve problem (14) in a direct optimization sense. Instead, they approximate this design objective through standardized limited-feedback schemes, including Type-I, Type-II, and PSC [3GPP38214, Fu2023TutorialCodebooks]. Within the unified framework developed above, these methods define practical implementations of the UE-side compression mapping 𝒬\mathcal{Q} and the BS-side reconstruction mapping \mathcal{F} under explicit signaling and complexity constraints. Their main differences lie in the representation granularity of CSI, the resulting feedback payload, and hence, the beamforming overhead-performance tradeoff. In this section, we reinterpret these mechanisms through the unified optimization viewpoint established in Section III.

IV-A Type-I Feedback

The Type-I feedback serves as the simplest and lightest-weight limited-feedback beamforming mechanism in current NR systems, offering low implementation complexity and feedback overhead at the cost of relatively low spectral efficiency.

IV-A1 Characterization using the Proposed Framework

Within the unified formulation in problem (14), the Type-I feedback can be viewed as an extremely coarse CSI-quantization scheme. Concretely, since the BS does not know the UE-specific dominant directions a priori, whereas the UE requires sufficiently comprehensive channel observations to perform reliable codebook-based beam selection, non-precoded CSI-RS, i.e., 𝐂I=𝐈Nt\mathbf{C}_{\rm I}=\mathbf{I}_{N_{t}}, is first adopted to probe all effective transmit ports in a direction-agnostic manner 333Typically, each port is linked to multiple physical antennas, allowing for enhanced signal quality and increased capacity in the communication system. However, we assume the number of CSI-RS ports equals the number of transmit antennas here for analytical simplicity..
After receiving CSI-RS, the UE obtains the perfect DL CSI 𝐡\mathbf{h} as assumed previously. To reduce CSI feedback overhead, the BS and UE share a predefined structured codebook 𝐃\mathbf{D} to quantize the channel, which is typically an oversampled discrete Fourier transform (DFT) codebook. For a ULA with NtN_{t} antennas, the quantization codebook is of dimension Nt×NtODN_{t}\times N_{t}O_{D} with oversampling factor ODO_{D}. The mm-th column of 𝐃\mathbf{D} can be constructed as

𝐝m=1Nt[1,ej2πmNtOD,,ej2πm(Nt1)NtOD]T,\mathbf{d}_{m}=\frac{1}{\sqrt{N_{t}}}\left[1,e^{j2\pi\frac{m}{N_{t}O_{D}}},\ldots,e^{j2\pi\frac{m(N_{t}-1)}{N_{t}O_{D}}}\right]^{T}, (21)

where m=0,1,,NtOD1m=0,1,\cdots,N_{t}O_{D}-1. In the Type-I feedback, the UE selects the beam that best matches the estimated channel from 𝐃\mathbf{D}. The corresponding index is then fed back as the precoding matrix indicator (PMI) to the BS 444Other CSI quantities such as the rank indicator and channel quality indicator are also fed back in this process to support multi-layer transmission and modulation-and-coding selection. However, these quantities are beyond the scope of this paper and are not considered further.. In this manner, the UE-side CSI compression mapping can be expressed as

zI=𝒬I(𝐡)=argmaxm=1,,NtOD|𝐡H𝐝m|2.z_{\rm I}=\mathcal{Q}_{\rm I}(\mathbf{h})=\operatorname*{arg\,max}_{m=1,\cdots,N_{t}O_{D}}\left|\mathbf{h}^{H}\mathbf{d}_{m}\right|^{2}. (22)

Note that quantization of zIz_{\rm I} in the transmission process is not considered in this paper.
After receiving zIz_{\rm I}, the BS reconstructs the CSI from the PMI using the predefined oversampled DFT codebook 𝐃\mathbf{D} as

𝐡^I=I(zI)=𝐝zI.\hat{\mathbf{h}}_{\rm I}=\mathcal{F}_{\rm I}(z_{\rm I})=\mathbf{d}_{z_{\rm I}}. (23)

Type-I feedback then defines a one-dimensional subspace 𝒰I=span(𝐝zI)\mathcal{U}_{\rm I}=\mathrm{span}(\mathbf{d}_{z_{\rm I}}) with projector 𝐏I=𝐝zI𝐝zIH\mathbf{P}_{\rm I}=\mathbf{d}_{z_{\rm I}}\mathbf{d}_{z_{\rm I}}^{H}. As a consequence, the optimal Type-I beamforming vector is given by

𝐰^I=I(𝒬I((𝐲CSI(𝐈Nt))))I(𝒬I((𝐲CSI(𝐈Nt))))=𝐝zI,\hat{\mathbf{w}}_{\rm I}=\frac{\mathcal{F}_{\rm I}\big(\mathcal{Q}_{\rm I}(\mathcal{R}(\mathbf{y}_{\text{CSI}}(\mathbf{I}_{N_{t}})))\big)}{\|\mathcal{F}_{\rm I}\big(\mathcal{Q}_{\rm I}(\mathcal{R}(\mathbf{y}_{\text{CSI}}(\mathbf{I}_{N_{t}})))\big)\|}=\mathbf{d}_{z_{\rm I}}, (24)

This expression shows that Type-I scheme realizes limited-feedback beamforming by quantizing the spatial channel with a DFT basis and selecting a single direction that best matches the channel. Hence, the Type-I scheme primarily beams toward one dominant direction on the predefined grid. This makes Type-I feedback a low-complexity and low-overhead feasible realization of (14), but its achievable rate is constrained by the codebook resolution and the rank-one representation. The resulting beamforming loss can be pronounced when the actual channel direction is mismatched with the predefined grid, or when the channel energy is spread over multiple propagation paths. Type-I feedback therefore serves as a baseline beam-management mechanism in current NR systems and is particularly suitable for scenarios with extremely limited feedback capability.

IV-A2 CSI-Capture Efficiency Analysis

The Type-I feedback is equivalent to choosing the projector that captures the largest channel energy. Accordingly, the achievable rate for Type-I feedback is given by

RI\displaystyle R_{\rm I} =log2(1+ρ|𝐡H𝐰^I|2)=log2(1+ρ𝐏I𝐡2)\displaystyle=\log_{2}\left(1+\rho\left|\mathbf{h}^{H}\hat{\mathbf{w}}_{\rm I}\right|^{2}\right)=\log_{2}\left(1+\rho\|\mathbf{P}_{\rm I}\mathbf{h}\|^{2}\right)
=log2(1+ρ|𝐡H𝐝zI|2)=log2(1+ρ|𝜶H𝐀H𝐝zI|2)\displaystyle={\log_{2}}\left({1+\rho{{\left|{{{\bf{h}}^{H}}{\bf{d}}_{z_{\rm I}}}\right|}^{2}}}\right)={\log_{2}}\left({1+\rho{{\left|{{\bm{\alpha}^{H}}{{\bf{A}}^{H}}{\bf{d}}_{z_{\rm I}}}\right|}^{2}}}\right)
(a)log2(1+ρ|αlI|2|𝐚H(ulI)𝐝zI|2),\displaystyle\overset{(a)}{\approx}{\log_{2}}\left({1+\rho{{\left|\alpha_{l_{\rm I}}\right|}^{2}}{{\left|{{{\bf{a}}^{H}}\left({u}_{l_{\rm I}}\right){\bf{d}}_{z_{\rm I}}}\right|}^{2}}}\right), (25)

where lIargmaxl|αl|2l_{\rm I}\triangleq\arg\max_{l}|\alpha_{l}|^{2} denotes the strongest path. The approximation in step (a)(a) is based on Assumption 1. More particularly, for Type-I feedback, it has gl0,llIg_{l}\approx 0,\forall l\neq l_{\rm I} in (20) and

glI\displaystyle g_{l_{\rm I}} =|𝐚H(ulI)𝐝zI|2=|1Ntn=0Nt1ej2πn(ulIzINtOD)|2\displaystyle={{\left|{{{\bf{a}}^{H}}\left({u}_{l_{\rm I}}\right){\bf{d}}_{z_{\rm I}}}\right|}^{2}}={\left|\frac{1}{N_{t}}\sum_{n=0}^{N_{t}-1}e^{j2\pi n\left(u_{l_{\rm I}}-\frac{{z_{\rm I}}}{N_{t}O_{D}}\right)}\right|}^{2}
=1Nt2|sin(πNtΔl,m)sin(πΔl,m)|2,\displaystyle=\frac{1}{N_{t}^{2}}\left|\frac{\sin\!\left(\pi N_{t}\Delta_{l,m}\right)}{\sin\!\left(\pi\Delta_{l,m}\right)}\right|^{2}, (26)

where ΔlI,zI=ulIzINtOD\Delta_{{l_{\rm I}},{z_{\rm I}}}=u_{l_{\rm I}}-\frac{{z_{\rm I}}}{N_{t}O_{D}}. For the oversampled DFT codebook, we have |ΔlI,zI|12NtOD|\Delta_{{l_{\rm I}},{z_{\rm I}}}|\leq\frac{1}{2N_{t}O_{D}}, which yields glI[gmin,1]g_{l_{\rm I}}\in[g_{\min},1] with

gmin=1Nt2|sin(π2OD)sin(π2NtOD)|2.g_{\min}=\frac{1}{N_{t}^{2}}\left|\frac{\sin\!\left(\frac{\pi}{2O_{D}}\right)}{\sin\!\left(\frac{\pi}{2N_{t}O_{D}}\right)}\right|^{2}. (27)

Overall, the spectral efficiency of Type-I feedback can be approximated as

RIlog2(1+ρ|αlI|2glI).R_{\rm I}\approx{\log_{2}}\left({1+\rho{{\left|\alpha_{l_{\rm I}}\right|}^{2}}g_{l_{\rm I}}}\right). (28)

Based on the above analysis and the results in Theorem 1, the CSI-capture efficiency for Type-I feedback method is given in the following corollary.

Corollary 1 (Type-I CSI-capture efficiency).

Under Assumption 1, the CSI-capture efficiency of Type-I feedback can be approximated as

ηI=𝐏I𝐡2𝐡2|αlI|2glIl=1L|αl|2.\eta_{\rm I}=\frac{\|\mathbf{P}_{\rm I}\mathbf{h}\|^{2}}{\|\mathbf{h}\|^{2}}\approx\frac{|\alpha_{l_{\rm I}}|^{2}g_{l_{\rm I}}}{\sum_{l=1}^{L}|\alpha_{l}|^{2}}. (29)

Corollary 1 shows that the loss of Type-I feedback is caused by two factors: the energy outside the single selected dominant path and the DFT-grid mismatch captured by glIg_{l_{\rm I}}. The first factor limits Type-I feedback in multi-path channels, while the second motivates richer or learned representation bases beyond single-beam quantization.

IV-B Type-II Feedback

To overcome the coarse spatial quantization of Type-I feedback, Type-II scheme provides a more flexible and higher-resolution beam-domain representation of the channel.

IV-B1 Characterization using the Proposed Framework

As in Type-I feedback, non-precoded CSI-RS is still adopted so that the UE can probe the full-port DL CSI space, i.e., 𝐂II=𝐈Nt\mathbf{C}_{\rm II}=\mathbf{I}_{N_{t}}. After acquiring the perfect DL CSI, the UE no longer selects only a single beam. Instead, it reports a small set of beams from a predefined oversampled DFT dictionary together with their combining coefficients. Consequently, Type-II feedback retains a CSI-RS overhead on the same order as Type-I, while incurring a higher feedback cost because both beam indices and coefficients must be conveyed.
Concretely, the UE selects QQ dominant spatial directions from the oversampled DFT codebook 𝐃\mathbf{D} based on the DL CSI. Supposing 𝒮{0,,NtOD1}\mathcal{S}\subseteq\{0,\ldots,N_{t}O_{D}-1\} and |𝒮|=Q|\mathcal{S}|=Q, the corresponding beam subspace is constructed as 𝒰𝒮=span(𝐃𝒮)=span([𝐝m]m𝒮)\mathcal{U}_{\mathcal{S}}=\mathrm{span}(\mathbf{D}_{\mathcal{S}})=\mathrm{span}([\mathbf{d}_{m}]_{m\in\mathcal{S}}). Therefore, the UE determines the optimal beam indices by solving

𝒮II=argmax𝒮𝐏𝒰𝒮𝐡2,\mathcal{S}_{\rm II}=\arg\max_{\mathcal{S}}\left\|\mathbf{P}_{\mathcal{U}_{\mathcal{S}}}\mathbf{h}\right\|^{2}, (30)

where 𝐏𝒰𝒮=𝐃𝒮(𝐃𝒮H𝐃𝒮)1𝐃𝒮H\mathbf{P}_{\mathcal{U}_{\mathcal{S}}}=\mathbf{D}_{\mathcal{S}}(\mathbf{D}_{\mathcal{S}}^{H}\mathbf{D}_{\mathcal{S}})^{-1}\mathbf{D}_{\mathcal{S}}^{H} is the orthogonal projector onto the subspace spanned by the selected beams. The CSI representation subspace defined by Type-II feedback is therefore 𝒰II=span(𝐃𝒮II)\mathcal{U}_{\rm II}=\mathrm{span}(\mathbf{D}_{\mathcal{S}_{\rm II}}). The UE then estimates the combining coefficients by minimizing the compression error

𝜶II=argmin𝜶𝐡𝐃𝒮II𝜶2=𝐃𝒮II𝐡.\bm{\alpha}_{\rm II}=\arg\min_{\bm{\alpha}}\|\mathbf{h}-\mathbf{D}_{\mathcal{S}_{\rm II}}\bm{\alpha}\|^{2}=\mathbf{D}_{\mathcal{S}_{\rm II}}^{\dagger}\mathbf{h}. (31)

Therefore, the UE-side compression mapping can be expressed as

𝐳II=𝒬II(𝐡)=(𝒮II,𝜶II).\mathbf{z}_{\rm II}=\mathcal{Q}_{\rm II}(\mathbf{h})=\left(\mathcal{S}_{\rm II},\bm{\alpha}_{\rm II}\right). (32)

More particularly, the UE reports both the selected beam indices and the corresponding combining coefficients to the BS. After receiving 𝐳II\mathbf{z}_{\rm II}, the BS reconstructs the DL CSI via

𝐡^II=II(𝐳II)=𝐃𝒮II𝜶II=𝐃𝒮II𝐃𝒮II𝐡=𝐏II𝐡,\hat{\mathbf{h}}_{\rm II}=\mathcal{F}_{\rm II}(\mathbf{z}_{\rm II})=\mathbf{D}_{\mathcal{S}_{\rm II}}\bm{\alpha}_{\rm II}=\mathbf{D}_{\mathcal{S}_{\rm II}}\mathbf{D}_{\mathcal{S}_{\rm II}}^{\dagger}\mathbf{h}=\mathbf{P}_{\rm II}\mathbf{h}, (33)

which is the projection of the actual channel onto the subspace spanned by the selected directions. The resulting Type-II beamformer is

𝐰^II=II(𝒬II((𝐲CSI(𝐈Nt))))II(𝒬II((𝐲CSI(𝐈Nt))))=𝐏II𝐡𝐏II𝐡,\hat{\mathbf{w}}_{\rm II}=\frac{\mathcal{F}_{\rm II}\big(\mathcal{Q}_{\rm II}(\mathcal{R}(\mathbf{y}_{\text{CSI}}(\mathbf{I}_{N_{t}})))\big)}{\|\mathcal{F}_{\rm II}\big(\mathcal{Q}_{\rm II}(\mathcal{R}(\mathbf{y}_{\text{CSI}}(\mathbf{I}_{N_{t}})))\big)\|}=\frac{\mathbf{P}_{\rm II}\mathbf{h}}{\|\mathbf{P}_{\rm II}\mathbf{h}\|}, (34)

Compared with Type-I feedback, the feasible beamforming set is no longer restricted to a single codeword but to the subspace spanned by a set of selected beams. This provides greater representation flexibility and usually improves the achievable beamforming gain in multipath channels. However, the UE must determine both the dominant beam directions and the corresponding coefficients based on the estimated high-dimensional channel. Both beam selection and Moore–Penrose pseudoinverse calculation impose a high computational burden on the UE and require a larger feedback payload.

IV-B2 CSI-Capture Efficiency Analysis

The achievable rate for Type-II feedback is given by

RII\displaystyle R_{\rm II} =log2(1+ρ|𝐡H𝐰^II|2)=log2(1+ρ|𝐡H𝐏II𝐡𝐏II𝐡|2)\displaystyle=\log_{2}\left(1+\rho\left|\mathbf{h}^{H}\hat{\mathbf{w}}_{\rm II}\right|^{2}\right)=\log_{2}\left(1+\rho\left|\frac{\mathbf{h}^{H}\mathbf{P}_{\rm II}\mathbf{h}}{\|\mathbf{P}_{\rm II}\mathbf{h}\|}\right|^{2}\right)
=log2(1+ρ𝐏II𝐡2)=log2(1+ρ𝜶H𝐀H𝐏II𝐀𝜶)\displaystyle=\log_{2}\left(1+\rho\|\mathbf{P}_{\rm II}\mathbf{h}\|^{2}\right)=\log_{2}\left(1+\rho\,\bm{\alpha}^{H}\mathbf{A}^{H}\mathbf{P}_{\rm II}\mathbf{A}\bm{\alpha}\right)
(b)log2(1+ρlQ,II|αl|2gl,II),\displaystyle\overset{(b)}{\approx}\log_{2}\left(1+\rho\sum_{l\in\mathcal{L}_{Q,\rm II}}|\alpha_{l}|^{2}g_{l,\rm II}\right), (35)

where Q,II\mathcal{L}_{Q,\rm II} denotes the set of dominant paths captured by the selected Type-II subspace. The approximation in (b) is again based on Assumption 1. Thus, the path-capture factor satisfies gl,II0g_{l,\rm II}\approx 0 for lQ,IIl\notin\mathcal{L}_{Q,\rm II}, whereas for captured paths it is given by

gl,II\displaystyle g_{l,\rm II} 𝐏II𝐚(ul)2m𝒮II|𝐚H(ul)𝐝m|2\displaystyle\triangleq\|\mathbf{P}_{\rm II}\mathbf{a}(u_{l})\|^{2}\approx\sum_{m\in\mathcal{S}_{\rm II}}\left|\mathbf{a}^{H}(u_{l})\mathbf{d}_{m}\right|^{2}
=m𝒮II1Nt2|sin(πNtΔl,m)sin(πΔl,m)|2.\displaystyle=\sum_{m\in\mathcal{S}_{\rm II}}\frac{1}{N_{t}^{2}}\left|\frac{\sin\!\left(\pi N_{t}\Delta_{l,m}\right)}{\sin\!\left(\pi\Delta_{l,m}\right)}\right|^{2}. (36)

For additional intuition, we further have

gl,II|𝐚H(ul)𝐝ml|2=glI,g_{l,\rm II}\gtrsim\left|\mathbf{a}^{H}(u_{l})\mathbf{d}_{m_{l}^{\star}}\right|^{2}=g_{l_{\rm I}}, (37)

where ml=argminm𝒮II|ulmNtOD|m_{l}^{\star}=\arg\min_{m\in\mathcal{S}_{\rm II}}\left|u_{l}-\frac{m}{N_{t}O_{D}}\right| denotes the selected DFT atom closest to the ll-th path. The resulting CSI-capture efficiency of the Type-II feedback is given below.

Corollary 2 (Type-II CSI-capture efficiency).

Under Assumption 1, the CSI-capture efficiency of Type-II feedback can be approximated as

ηII=𝐏II𝐡2𝐡2lQ,II|αl|2gl,IIl=1L|αl|2.\eta_{\rm II}=\frac{\|\mathbf{P}_{\rm II}\mathbf{h}\|^{2}}{\|\mathbf{h}\|^{2}}\approx\frac{\sum_{l\in\mathcal{L}_{Q,\rm II}}|\alpha_{l}|^{2}g_{l,\rm II}}{\sum_{l=1}^{L}|\alpha_{l}|^{2}}. (38)

Corollary 2 shows that Type-II feedback improves over single-beam quantization by allowing several selected dictionary beams to jointly capture multipath energy. Its remaining loss comes from the energy outside the selected subspace and from DFT-basis mismatch, while its cost is the UE-side search and coefficient-estimation burden required to construct this subspace from full-dimensional CSI.

IV-C Port-Selection Feedback

To reduce the high feedback overhead and UE-side computational complexity of the flexible Type-II feedback while retaining a richer channel representation than the single-beam Type-I feedback, the PSC was introduced as a more structured limited-feedback mechanism. Instead of representing the channel through a large beam-domain dictionary, PSC restricts the feedback to a selected subset of effective ports and the associated low-dimensional coefficients.

IV-C1 Characterization using the Proposed Framework

As in Type-I and Type-II feedback, PSC still relies on non-precoded CSI-RS to expose the full CSI-RS port space to the UE, i.e., 𝐂PSC=𝐈Nt\mathbf{C}_{\rm PSC}=\mathbf{I}_{N_{t}}. After acquiring the full-port CSI 𝐡\mathbf{h}, the UE selects a subset of NpN_{p} effective ports that preserve as much channel energy as possible. Let 𝒮{1,,Nt},|𝒮|=Np\mathcal{S}\subseteq\{1,\ldots,N_{t}\},|\mathcal{S}|=N_{p} denote the selected effective-port set, and let

𝐄𝒮=[𝐞p]p𝒮Nt×Np\mathbf{E}_{\mathcal{S}}=[\mathbf{e}_{p}]_{p\in\mathcal{S}}\in\mathbb{C}^{N_{t}\times N_{p}} (39)

be the associated selection matrix, where 𝐞p\mathbf{e}_{p} is the pp-th canonical basis vector. The UE selects the effective-port subset that best captures the channel energy, which can be formulated as

𝒮PSC=argmax𝒮𝐏𝒮𝐡2,\mathcal{S}_{\rm PSC}=\arg\max_{\mathcal{S}}\left\|\mathbf{P}_{\mathcal{S}}\mathbf{h}\right\|^{2}, (40)

where 𝐏𝒮=𝐄𝒮(𝐄𝒮H𝐄𝒮)1𝐄𝒮H=𝐄𝒮𝐄𝒮H\mathbf{P}_{\mathcal{S}}=\mathbf{E}_{\mathcal{S}}(\mathbf{E}_{\mathcal{S}}^{H}\mathbf{E}_{\mathcal{S}})^{-1}\mathbf{E}_{\mathcal{S}}^{H}=\mathbf{E}_{\mathcal{S}}\mathbf{E}_{\mathcal{S}}^{H} is the orthogonal projector onto the port-selected subspace. This is consistent with selecting the strongest NpN_{p} effective ports. The CSI representation subspace defined by PSC is therefore 𝒰PSC=span(𝐄𝒮PSC)\mathcal{U}_{\rm PSC}=\mathrm{span}(\mathbf{E}_{\mathcal{S}_{\rm PSC}}).
The UE then estimates the low-dimensional coefficient vector over the selected subspace by solving the same LS problem as in Type-II feedback, which yields

𝜶PSC=argmin𝜶𝐡𝐄PSC𝜶2=𝐄PSC𝐡=𝐄PSCH𝐡.\bm{\alpha}_{\rm PSC}=\arg\min_{\bm{\alpha}}\|\mathbf{h}-\mathbf{E}_{\rm PSC}\bm{\alpha}\|^{2}=\mathbf{E}_{\rm PSC}^{\dagger}\mathbf{h}=\mathbf{E}_{\rm PSC}^{H}\mathbf{h}. (41)

A key distinction from Type-II feedback is that the coefficients are obtained by selecting entries from the estimated CSI rather than by performing Moore–Penrose pseudoinverse calculation over a beam-domain dictionary, which significantly reduces the UE-side computational complexity. Then, the UE reports

𝐳PSC=𝒬PSC(𝐡)=(𝒮PSC,𝜶PSC).\mathbf{z}_{\rm PSC}=\mathcal{Q}_{\rm PSC}(\mathbf{h})=\left(\mathcal{S}_{\rm PSC},\bm{\alpha}_{\rm PSC}\right). (42)

Correspondingly, the BS recovers the CSI as

𝐡^PSC=PSC(𝐳PSC)=𝐄PSC𝜶PSC=𝐏PSC𝐡,\hat{\mathbf{h}}_{\rm PSC}=\mathcal{F}_{\rm PSC}(\mathbf{z}_{\rm PSC})=\mathbf{E}_{\rm PSC}\bm{\alpha}_{\rm PSC}=\mathbf{P}_{\rm PSC}\mathbf{h}, (43)

which leads to the beamformer

𝐰^PSC=PSC(𝒬PSC((𝐲CSI(𝐈Nt))))PSC(𝒬PSC((𝐲CSI(𝐈Nt))))=𝐏PSC𝐡𝐏PSC𝐡.\hat{\mathbf{w}}_{\rm PSC}=\frac{\mathcal{F}_{\rm PSC}\big(\mathcal{Q}_{\rm PSC}(\mathcal{R}(\mathbf{y}_{\text{CSI}}(\mathbf{I}_{N_{t}})))\big)}{\|\mathcal{F}_{\rm PSC}\big(\mathcal{Q}_{\rm PSC}(\mathcal{R}(\mathbf{y}_{\text{CSI}}(\mathbf{I}_{N_{t}})))\big)\|}=\frac{\mathbf{P}_{\rm PSC}\mathbf{h}}{\|\mathbf{P}_{\rm PSC}\mathbf{h}\|}. (44)

Under the unified framework, PSC offers a more structured representation, a smaller search space, and lower implementation complexity than Type-II feedback.

IV-C2 CSI-Capture Efficiency Analysis

Let |h~|(1)2|h~|(Nc)2|\tilde{h}|_{(1)}^{2}\geq\cdots\geq|\tilde{h}|_{(N_{c})}^{2} denote the sorted effective-port powers. The PSC projector captures the energy of the selected effective ports. Therefore, before accounting for CSI-RS and feedback overhead, the instantaneous rate associated with PSC beamforming is

RPSC\displaystyle R_{\rm PSC} =log2(1+ρ|𝐡H𝐰^PSC|2)=log2(1+ρ𝐏PSC𝐡2)\displaystyle=\log_{2}\left(1+\rho\left|\mathbf{h}^{H}\hat{\mathbf{w}}_{\rm PSC}\right|^{2}\right)=\log_{2}\left(1+\rho\|\mathbf{P}_{\rm PSC}\mathbf{h}\|^{2}\right)
=log2(1+ρi=1Np|h~|(i)2).\displaystyle=\log_{2}\left(1+\rho\sum_{i=1}^{N_{p}}|\tilde{h}|_{(i)}^{2}\right). (45)
Corollary 3 (PSC CSI-capture efficiency).

For a selected effective-port set of size NpN_{p}, the CSI-capture efficiency of PSC is

ηPSC=𝐏PSC𝐡2𝐡2=i=1Np|h~|(i)2𝐡2.\eta_{\rm PSC}=\frac{\|\mathbf{P}_{\rm PSC}\mathbf{h}\|^{2}}{\|\mathbf{h}\|^{2}}=\frac{\sum_{i=1}^{N_{p}}|\tilde{h}|_{(i)}^{2}}{\|\mathbf{h}\|^{2}}. (46)

Corollary 3 indicates that PSC is governed by the concentration of the effective-port-domain channel. It can be efficient when a small number of ports dominate, but it may lose substantial energy when the effective-port powers are spread more uniformly. Unlike a global ordering among Type-I, Type-II, and PSC, this interpretation is explicitly channel- and representation-dependent.

V Site-Specific Type-II Feedback

The above subspace-projection view also clarifies the main limitation of conventional feedback mechanisms. To achieve high-quality beamforming, especially in Type-II feedback, the UE must perform several costly online tasks, including high-dimensional CSI acquisition, dominant-subspace identification, and coefficient calculation within the selected subspace. Consequently, improving the subspace representation capability usually comes at the cost of substantially higher CSI acquisition, feedback, and UE-side computation overhead.
To overcome this limitation, we propose a site-specific Type-II feedback scheme with two components: BS-side SSI-conditioned subspace inference and UE-side low-dimensional coefficient feedback. In particular, the BS first infers a UE-dependent dominant beam subspace from site-specific knowledge, and the UE then only refines the low-dimensional coefficients within that subspace. In this way, the proposed scheme reduces UE-side overhead while preserving the rich subspace representation capability of Type-II feedback.

V-A Site-Specific Subspace Inference

Compared with conventional Type-II feedback, the key idea of the proposed feedback scheme is to infer the dominant transmit subspace at the BS before explicit UE-side CSI estimation and feedback. To make this possible, the BS must rely on a lightweight channel-dependent observation that is available before the CSI-RS stage. The RSRP vector 𝐫𝐁\mathbf{r}_{\mathbf{B}} acquired during SSB probing serves this purpose.
Specifically, the RSRP vector captures how the channel energy is distributed over a set of coarse probing directions. Although this RSRP vector is insufficient to recover the full instantaneous CSI because phase information is unavailable, it provides a beam-domain power fingerprint that is strongly related to the dominant angular support of the channel [Heng2022SiteSpecificProbing, Ning2023RSRPCodebook, sim]. In site-specific environments with relatively stable propagation geometry and a limited number of dominant scattering clusters, similar dominant propagation directions tend to induce similar RSRP patterns. Accordingly, the RSRP vector provides a suitable low-overhead input for BS-side prediction of a candidate dominant beam subspace.
To this end, we suppose that the BS infers a UE-dependent low-dimensional channel subspace through a mapping

𝐂p=Ψ(𝐫𝐁)=[𝐜p,1,,𝐜p,Q]Nt×Q,\mathbf{C}_{p}=\Psi(\mathbf{r}_{\mathbf{B}})=[\mathbf{c}_{p,1},\cdots,\mathbf{c}_{p,Q}]\in\mathbb{C}^{N_{t}\times Q}, (47)

where QNtQ\ll N_{t} and the columns of 𝐂p\mathbf{C}_{p} are expected to align with the dominant propagation directions of the current UE. Without loss of generality, we assume that 𝐂p\mathbf{C}_{p} is an orthonormal basis of the inferred subspace, since any linearly independent basis can be transformed into an equivalent orthonormal basis via QR decomposition without changing its span.
In this way, the dominant transmit directions are no longer searched exhaustively at the UE as in conventional Type-II, but are instead inferred at the BS by conditioning the offline learned SSI on the RSRP fingerprint. The remaining online feedback task is therefore reduced to refining the instantaneous low-dimensional coefficients over the inferred subspace.

V-B Low-Dimensional Coefficient Feedback

After the BS determines 𝐂p\mathbf{C}_{p}, it transmits beamformed CSI-RS only over this inferred subspace. The received CSI-RS at the UE is then

𝐲CSIT(𝐂p)=PCSI𝐡H𝐂p𝐒CSI+𝐧CSIT,\mathbf{y}_{\text{CSI}}^{T}(\mathbf{C}_{p})=\sqrt{P_{\text{CSI}}}\mathbf{h}^{H}\mathbf{C}_{p}\mathbf{S}_{\text{CSI}}+\mathbf{n}_{\text{CSI}}^{T}, (48)

where 𝐒CSIQ×Lc\mathbf{S}_{\text{CSI}}\in\mathbb{C}^{Q\times L_{c}}. Compared with the conventional limited-feedback schemes, the CSI-RS dimension is reduced from NtN_{t} to QQ, which significantly reduces the CSI acquisition overhead from 𝒪(Nt2)\mathcal{O}(N_{t}^{2}) to 𝒪(Q2)\mathcal{O}(Q^{2}). In this case, the UE observes the effective channel over the inferred subspace

𝐡p=𝐂pH𝐡.\mathbf{h}_{p}=\mathbf{C}_{p}^{H}\mathbf{h}. (49)

We still assume that the UE can perfectly recover 𝐡p\mathbf{h}_{p} from the received CSI-RS, i.e., 𝐡p=p(𝐲CSI(𝐂p))\mathbf{h}_{p}=\mathcal{R}_{p}\big(\mathbf{y}_{\text{CSI}}(\mathbf{C}_{p})\big).
Therefore, unlike conventional Type-II feedback, the UE no longer needs to either determine the dominant beam directions or estimate the coefficients by solving an optimization problem from the full-dimensional channel estimate. Instead, it can directly report the estimated 𝐡p\mathbf{h}_{p}, which already provides a low-dimensional representation of the DL CSI, to the BS. In this sense, the UE-side CSI compression mapping 𝒬p\mathcal{Q}_{p} reduces to an identity mapping over the inferred low-dimensional channel domain:

𝐳p=𝒬p(𝐡p)=𝜶p=𝐡p.\mathbf{z}_{p}=\mathcal{Q}_{p}(\mathbf{h}_{p})=\bm{\alpha}_{p}=\mathbf{h}_{p}. (50)

Accordingly, after receiving 𝐳p\mathbf{z}_{p}, the BS reconstructs the channel representation according to

𝐡^p=p(𝐳p;𝐂p)=𝐂p𝐳p=𝐂p𝐡p=𝐂p𝐂pH𝐡.\hat{\mathbf{h}}_{p}=\mathcal{F}_{p}(\mathbf{z}_{p};\mathbf{C}_{p})=\mathbf{C}_{p}\mathbf{z}_{p}=\mathbf{C}_{p}\mathbf{h}_{p}=\mathbf{C}_{p}\mathbf{C}_{p}^{H}\mathbf{h}. (51)

In this process, the CSI representation subspace coincides with the CSI-RS precoder, i.e., 𝐔p=𝐂p\mathbf{U}_{p}=\mathbf{C}_{p}. The corresponding orthogonal projector is 𝐏p=𝐂p𝐂pH\mathbf{P}_{p}=\mathbf{C}_{p}\mathbf{C}_{p}^{H}. The beamforming vector is then obtained by applying MRT to the reconstructed channel as

𝐰^p=p(𝒬p(p(𝐲CSI(𝐂p))))p(𝒬p(p(𝐲CSI(𝐂p))))=𝐏p𝐡𝐏p𝐡.\hat{\mathbf{w}}_{p}=\frac{\mathcal{F}_{p}\big(\mathcal{Q}_{p}(\mathcal{R}_{p}(\mathbf{y}_{\text{CSI}}(\mathbf{C}_{p})))\big)}{\|\mathcal{F}_{p}\big(\mathcal{Q}_{p}(\mathcal{R}_{p}(\mathbf{y}_{\text{CSI}}(\mathbf{C}_{p})))\big)\|}=\frac{\mathbf{P}_{p}\mathbf{h}}{\|\mathbf{P}_{p}\mathbf{h}\|}. (52)

In this way, the proposed scheme remains compatible with the existing feedback pipeline while shifting both the high-dimensional dominant-subspace discovery task and the following coefficient retrieval burden from the UE to the BS through site-specific subspace inference.

V-C CSI-Capture Efficiency Analysis

The achievable rate of the proposed design is given by

Rp=log2(1+ρ|𝐡H𝐰^p|2)=log2(1+ρ𝐏p𝐡2),R_{p}=\log_{2}\left(1+\rho\left|\mathbf{h}^{H}\hat{\mathbf{w}}_{p}\right|^{2}\right)=\log_{2}\left(1+\rho\|\mathbf{P}_{p}\mathbf{h}\|^{2}\right), (53)

Suppose that there exists an oracle QQ-dimensional subspace 𝒰Q\mathcal{U}_{Q}^{\star} that captures the most channel energy among all QQ-dimensional subspaces, i.e.,

𝒰Q=argmax𝒰:dim(𝒰)=Q𝐏𝒰𝐡2.\mathcal{U}_{Q}^{\star}=\arg\max_{\mathcal{U}:\dim(\mathcal{U})=Q}\|\mathbf{P}_{\mathcal{U}}\mathbf{h}\|^{2}. (54)

The corresponding orthogonal projector is denoted by 𝐏Q\mathbf{P}_{Q}^{\star}. Here, 𝒰Q\mathcal{U}_{Q}^{\star} is used only as an analytical benchmark, since such a subspace is practically inaccessible at the BS. For example, when Q=1Q=1, 𝒰Q=span(𝐡)\mathcal{U}_{Q}^{\star}=\mathrm{span}(\mathbf{h}) is an oracle subspace, but the BS cannot infer 𝐡\mathbf{h} from the RSRP vector. Alternatively, this oracle subspace may be interpreted as the subspace that best aligns with the dominant propagation directions. In particular, the oracle subspace can be spanned by the array response vectors of the QQ strongest paths, i.e., 𝒰Q=span([𝐚(ul)]lQ)\mathcal{U}_{Q}^{\star}=\mathrm{span}([\mathbf{a}(u_{l})]_{l\in\mathcal{L}_{Q}^{\star}}), where Q\mathcal{L}_{Q}^{\star} denotes the set of indices of the QQ strongest paths. The captured channel energy is therefore given by

𝐏Q𝐡2=𝜶H𝐀H𝐏Q𝐀𝜶lQ|αl|2gl,Q,\|\mathbf{P}_{Q}^{\star}\mathbf{h}\|^{2}=\bm{\alpha}^{H}\mathbf{A}^{H}\mathbf{P}_{Q}^{\star}\mathbf{A}\bm{\alpha}\approx\sum_{l\in\mathcal{L}_{Q}^{\star}}|\alpha_{l}|^{2}g_{l,Q^{\star}}, (55)

where

gl,Q𝐏Q𝐚(ul)21,lQg_{l,Q^{\star}}\triangleq\|\mathbf{P}_{Q}^{\star}\mathbf{a}(u_{l})\|^{2}\approx 1,\forall l\in\mathcal{L}_{Q}^{\star} (56)

denotes the corresponding oracle path-capture factor. In this case, the top-QQ paths are fully captured by the oracle subspace without the grid mismatch induced by the DFT basis in Type-I/II. Note that this oracle subspace is determined by the specific application and is typically unavailable to the BS. It is used only for theoretical analysis.

Theorem 2 (Subspace-mismatch bound for site-specific Type-II feedback).

Let 𝐏p\mathbf{P}_{p} be the projector onto the subspace inferred by the proposed SSI-enhanced feedback scheme, and let 𝐏Q\mathbf{P}_{Q}^{\star} denote the projector onto an oracle QQ-dimensional benchmark subspace. Define δp𝐏p𝐏Q2\delta_{p}\triangleq\|\mathbf{P}_{p}-\mathbf{P}_{Q}^{\star}\|_{2} and [x]+max{x,0}[x]_{+}\triangleq\max\{x,0\}. Then, the proposed scheme satisfies

𝐏p𝐡2[𝐏Q𝐡2δp𝐡2]+,\|\mathbf{P}_{p}\mathbf{h}\|^{2}\geq\left[\|\mathbf{P}_{Q}^{\star}\mathbf{h}\|^{2}-\delta_{p}\|\mathbf{h}\|^{2}\right]_{+}, (57)

and its achievable rate is lower bounded by

Rplog2(1+ρ[𝐏Q𝐡2δp𝐡2]+).R_{p}\geq\log_{2}\left(1+\rho\left[\|\mathbf{P}_{Q}^{\star}\mathbf{h}\|^{2}-\delta_{p}\|\mathbf{h}\|^{2}\right]_{+}\right). (58)

Moreover, the CSI-capture efficiency satisfies

ηp=𝐏p𝐡2𝐡2[𝐏Q𝐡2𝐡2δp]+.\eta_{p}=\frac{\|\mathbf{P}_{p}\mathbf{h}\|^{2}}{\|\mathbf{h}\|^{2}}\geq\left[\frac{\|\mathbf{P}_{Q}^{\star}\mathbf{h}\|^{2}}{\|\mathbf{h}\|^{2}}-\delta_{p}\right]_{+}. (59)
Proof:

Since both 𝐏p\mathbf{P}_{p} and 𝐏Q\mathbf{P}_{Q}^{\star} are Hermitian projectors,

𝐏p𝐡2=𝐏Q𝐡2+𝐡H(𝐏p𝐏Q)𝐡.\|\mathbf{P}_{p}\mathbf{h}\|^{2}=\|\mathbf{P}_{Q}^{\star}\mathbf{h}\|^{2}+\mathbf{h}^{H}(\mathbf{P}_{p}-\mathbf{P}_{Q}^{\star})\mathbf{h}. (60)

The Rayleigh quotient bound gives 𝐡H(𝐏p𝐏Q)𝐡𝐏p𝐏Q2𝐡2\mathbf{h}^{H}(\mathbf{P}_{p}-\mathbf{P}_{Q}^{\star})\mathbf{h}\geq-\|\mathbf{P}_{p}-\mathbf{P}_{Q}^{\star}\|_{2}\|\mathbf{h}\|^{2}. Combining this inequality with the non-negativity of 𝐏p𝐡2\|\mathbf{P}_{p}\mathbf{h}\|^{2} yields the energy bound. Substituting it into Rp=log2(1+ρ𝐏p𝐡2)R_{p}=\log_{2}(1+\rho\|\mathbf{P}_{p}\mathbf{h}\|^{2}) and normalizing by 𝐡2\|\mathbf{h}\|^{2} complete the proof. ∎

Theorem 2 separates the performance of the proposed feedback scheme into two interpretable terms. The first term, 𝐏Q𝐡2\|\mathbf{P}_{Q}^{\star}\mathbf{h}\|^{2}, represents the best channel energy that can be preserved by an ideal QQ-dimensional benchmark subspace. It is therefore a dimension-limited benchmark: it is upper bounded by 𝐡2\|\mathbf{h}\|^{2} and becomes equal to the full-channel energy only when the chosen QQ-dimensional subspace contains the channel direction. The second term, δp𝐡2\delta_{p}\|\mathbf{h}\|^{2}, quantifies the mismatch between the SSI-inferred subspace and the benchmark oracle subspace. Hence, the theorem shows that the proposed design is limited by both the intrinsic dimension constraint of the reduced CSI-RS subspace and the accuracy with which SSI-conditioned inference identifies that subspace. The overall illustration of the proposed limited-feedback scheme is shown in Fig. 1.

Refer to caption
Figure 1: Illustration of the proposed site-specific Type-II feedback

VI Joint Design of SSB Probing and Subspace Inference

The preceding section describes the proposed site-specific Type-II feedback scheme at the system and signal-processing level, where the site-specific subspace inference is abstracted as a mapping from the SSB probing measurement 𝐫𝐁\mathbf{r}_{\mathbf{B}} to a low-dimensional CSI subspace. The preceding analysis further demonstrates that the effectiveness of the proposed scheme critically depends on the quality of the inferred subspace 𝐂p\mathbf{C}_{p}, which is determined by the inference mapping Ψ()\Psi(\cdot).

VI-A Problem Formulation

The subspace-mismatch bound above motivates a task-oriented design criterion: the inferred subspace should preserve as large a fraction of the channel energy as possible for the channel distribution of the target site under a QQ-dimensional subspace bottleneck. However, directly minimizing the oracle mismatch δp\delta_{p} is not practical because the oracle projector is unavailable and can be non-unique. We therefore formulate the design in terms of CSI-capture efficiency, which directly measures the relative energy preserved by the inferred subspace.
The probing codebook 𝐁\mathbf{B} is coupled with this objective because the inferred subspace is computed only from the RSRP fingerprint 𝐫𝐁\mathbf{r}_{\mathbf{B}} generated during SSB probing. A generic oversampled DFT probing codebook provides uniform directional sounding, but it does not necessarily produce measurements that are most informative for dominant-subspace inference as analyzed and illustrated in [sim]. Conversely, the information-maximizing probing design in [sim] is not tailored to the subspace-capture objective considered here. Therefore, the probing stage and the BS-side inference mapping should be optimized as a single task-oriented module: the probing codebook should shape the RSRP fingerprint so that the inference mapping can recover a low-dimensional subspace with high CSI-capture efficiency.
Accordingly, for a given channel realization, the joint design of the probing codebook 𝐁\mathbf{B} and the inference mapping Ψ()\Psi(\cdot) can be formulated as the following optimization problem

max𝐁,Ψηp(𝐡;𝐁,Ψ)=Ψ(𝐫𝐁)ΨH(𝐫𝐁)𝐡2/𝐡2,\max_{\mathbf{B},\,\Psi}\;\eta_{p}(\mathbf{h};\mathbf{B},\Psi)=\left\|\Psi(\mathbf{r}_{\mathbf{B}})\Psi^{H}(\mathbf{r}_{\mathbf{B}})\mathbf{h}\right\|^{2}/\|\mathbf{h}\|^{2}, (61)

where the inferred basis is assumed to have been orthogonalized before constructing the projector. However, seeking a probing codebook 𝐁\mathbf{B} and an inference mapping Ψ()\Psi(\cdot) such that the inferred subspace preserves the channel energy for every possible channel realization is overly stringent for the present low-dimensional probing design. In particular, when Q<NtQ<N_{t}, satisfying the above condition for all 𝐡\mathbf{h} would require the probing measurement 𝐫𝐁\mathbf{r}_{\mathbf{B}} to identify, from low-overhead observations, a QQ-dimensional subspace aligned with arbitrary directions in Nt×1\mathbb{C}^{N_{t}\times 1}, which is in general impossible. Therefore, rather than pursuing universal optimality over the entire channel space, it is more practical and sufficient for the present problem to optimize the probing codebook and the inference mapping over the site-specific channel set induced by the target propagation environment:

max𝐁,Ψ𝔼𝐡ps[ηp(𝐡;𝐁,Ψ)],\max_{\mathbf{B},\,\Psi}\;\mathbb{E}_{\mathbf{h}\sim p_{s}}\!\left[\eta_{p}(\mathbf{h};\mathbf{B},\Psi)\right], (62)

where the expectation is taken over the site-specific channel distribution psp_{s}.

VI-B Learning-based End-to-End Solver

The above joint design problem remains difficult to solve by conventional optimization techniques. First, the site-specific channel distribution psp_{s} is generally unavailable in closed form and can only be accessed through channel samples collected from the target environment. Second, the probing codebook 𝐁\mathbf{B} and the inference mapping Ψ()\Psi(\cdot) are strongly coupled through the nonlinear RSRP measurement process and the resulting subspace projector. Third, practical constraints on the probing codebook, such as beam normalization or phase-only implementation, further render the problem highly nonconvex. Therefore, instead of seeking a closed-form solution, we adopt a task-driven end-to-end learning framework that directly optimizes the probing codebook and the inference mapping from site-specific channel data.
Specifically, the probing codebook 𝐁\mathbf{B} serves as an encoder fef_{e}, which maps the input channel to a low-dimensional dB-domain RSRP fingerprint,

𝐫𝐁0=fe(𝐡;𝐁)=10log10(PSSB|𝐁H𝐡|2),\mathbf{r}_{\mathbf{B}}^{0}=f_{e}(\mathbf{h};\mathbf{B})=10\log_{10}\!\left(P_{\mathrm{SSB}}\left|\mathbf{B}^{H}\mathbf{h}\right|^{2}\right), (63)

where the magnitude square and logarithm are applied elementwise. The intermediate layer then models the measurement uncertainty introduced by shadowing and thermal noise according to (8), yielding the observed RSRP vector 𝐫𝐁\mathbf{r}_{\mathbf{B}}. The subspace inference mapping is parameterized by a neural network Ψθ()\Psi_{\theta}(\cdot) with trainable parameters 𝜽\bm{\theta} and serves as the decoder fdf_{d}, which takes the noisy RSRP measurement as input and produces the inferred subspace representation,

𝐂p=fd(𝐫𝐁;𝜽)=Ψθ(𝐫𝐁).\mathbf{C}_{p}=f_{d}(\mathbf{r}_{\mathbf{B}};\bm{\theta})=\Psi_{\theta}(\mathbf{r}_{\mathbf{B}}). (64)

Let s={𝐡(n)}n=1Nh\mathcal{H}_{s}=\{\mathbf{h}^{(n)}\}_{n=1}^{N_{h}} denote a set of channel samples collected from the target site. If {𝐡(n)}n=1Nh\{\mathbf{h}^{(n)}\}_{n=1}^{N_{h}} are i.i.d. samples drawn from the site-specific channel distribution psp_{s}, then by the law of large numbers the empirical measure induced by s\mathcal{H}_{s} converges to psp_{s} as NhN_{h}\to\infty. In this case, the statistical objective in (62) is approximated by the empirical average

min𝐁,𝜽s=1Nhn=1Nhηp(𝐡(n);𝐁,Ψθ),\min_{\mathbf{B},\bm{\theta}}\;\mathcal{L}_{\mathcal{H}_{s}}=-\frac{1}{N_{h}}\sum_{n=1}^{N_{h}}\eta_{p}\!\left(\mathbf{h}^{(n)};\mathbf{B},\Psi_{\theta}\right), (65)

where ηp(𝐡(n);𝐁,Ψθ)\eta_{p}\!\left(\mathbf{h}^{(n)};\mathbf{B},\Psi_{\theta}\right) is evaluated using the probing measurement 𝐫𝐁(n)\mathbf{r}_{\mathbf{B}}^{(n)} generated from 𝐡(n)\mathbf{h}^{(n)} under the current probing codebook 𝐁\mathbf{B}.
A useful perspective is to view the proposed scheme as the first stage of a hierarchical CSI autoencoding process. Instead of directly reconstructing the instantaneous channel from a learned latent representation, the probing encoder and the subspace decoder first encode and recover a structural representation of the channel, namely, its dominant low-dimensional subspace. In this sense, the site-specific probing stage acts as a coarse channel autoencoder that extracts the slowly-varying spatial structure from the full-dimensional channel and represents it by a compact RSRP fingerprint. The subsequent CSI feedback stage then performs a second-stage low-dimensional refinement by feeding back the instantaneous coefficients within the inferred subspace. Therefore, the proposed framework decomposes CSI acquisition into structural subspace encoding and residual coefficient encoding, which is precisely what enables low-overhead yet effective feedback.

VI-C Practical Realization

Since the probing encoder 𝐁\mathbf{B} is inherently trainable, it can be directly optimized in the learning process, subject only to the unit-power constraint 𝐛k2=1\|\mathbf{b}_{k}\|^{2}=1 for each probing beam. The subspace decoder Ψθ\Psi_{\theta} can be implemented by a multi-layer perceptron (MLP), which has a depth-DD fully connected architecture with hidden width WW, layer normalization (LN) [ba2016layer], and Gaussian error linear unit (GELU) activation [Hendrycks2016GELU]. Let 𝐟(0)=𝐫¯𝐁(n)\mathbf{f}^{(0)}=\bar{\mathbf{r}}_{\mathbf{B}}^{(n)}. Then, for d=1,,Dd=1,\ldots,D, the hidden features are updated as

𝐟(d)=GELU(LN(𝐖d𝐟(d1)+𝐛d))W×1,\mathbf{f}^{(d)}=\mathrm{GELU}\!\left(\mathrm{LN}\!\left(\mathbf{W}_{d}\mathbf{f}^{(d-1)}+\mathbf{b}_{d}\right)\right)\in\mathbb{R}^{W\times 1}, (66)

where GELU(x)=xΦ(x)\mathrm{GELU}(x)=x\Phi(x) and Φ(x)\Phi(x) is the cumulative distribution function of the standard Gaussian distribution. The output layer yields

𝐟(out)=𝐖out𝐟(D)+𝐛out2NtQ×1.\mathbf{f}^{(\mathrm{out})}=\mathbf{W}_{\rm out}\mathbf{f}^{(D)}+\mathbf{b}_{\rm out}\in\mathbb{R}^{2N_{t}Q\times 1}. (67)

The output vector is then reshaped into the real and imaginary parts of a raw complex QQ-dimensional subspace representation 𝐂~p\widetilde{\mathbf{C}}_{p}. To remove the scale ambiguity of the predicted basis, each column of 𝐂~p\widetilde{\mathbf{C}}_{p} is normalized to unit norm and orthogonalized before constructing the corresponding subspace projector.
With the differentiable RSRP measurement model adopted above, the sample-average objective in (65) can be optimized with respect to both 𝐁\mathbf{B} and 𝜽\bm{\theta} through backpropagation. However, accurately capturing the site distribution typically requires a large NhN_{h}, which makes full-batch optimization computationally burdensome. Mini-batch stochastic gradient descent (SGD) [GhadimiLan2013] offers a scalable alternative by using a randomly sampled subset of UEs to form a stochastic approximation of the objective and its gradient. For a mini-batch s\mathcal{B}\subset\mathcal{H}_{s}, the empirical training loss is defined as

(𝐁,𝜽)=1||𝐡(n)ηp(𝐡(n);𝐁,Ψθ).\mathcal{L}_{\mathcal{B}}(\mathbf{B},\bm{\theta})=-\frac{1}{|\mathcal{B}|}\sum_{\mathbf{h}^{(n)}\in\mathcal{B}}\eta_{p}\!\left(\mathbf{h}^{(n)};\mathbf{B},\Psi_{\theta}\right). (68)

The overall implementation then follows a two-stage procedure, namely offline training and online deployment. During offline training, the probing codebook 𝐁\mathbf{B} and the inference network Ψθ()\Psi_{\theta}(\cdot) are optimized jointly over site-specific channel samples, which is summarized in Algorithm 1. During online deployment, the learned probing codebook 𝐁\mathbf{B} and subspace inference mapping Ψθ()\Psi_{\theta}(\cdot) are embedded into the framework introduced in Section V, whose detailed procedure is summarized in Algorithm 2.

Algorithm 1 Offline Joint Training of the Probing Codebook and the Subspace Inference Mapping
0:  Site-specific CSI dataset s\mathcal{H}_{s}, batch size BB, step β\beta, iterations II
0:  Optimized probing codebook and subspace inference mapping 𝐁\mathbf{B}^{\star} and Ψθ\Psi_{\theta^{\star}}
1:  Initialize probing codebook 𝐁(0)\mathbf{B}^{(0)} and network parameters 𝜽(0)\bm{\theta}^{(0)}.
2:  for i=1,2,,Ii=1,2,\cdots,I do
3:   Sample a mini-batch of data is\mathcal{B}_{i}\subset\mathcal{H}_{s} with |i|=B|\mathcal{B}_{i}|=B.
4:   Generate probing RSRP measurements by (63).
5:   Contaminate the RSRP measurements by (8).
6:   Infer the subspace by (64).
7:   Compute the loss i(𝐁(i1),𝜽(i1))\mathcal{L}_{\mathcal{B}_{i}}(\mathbf{B}^{(i-1)},\bm{\theta}^{(i-1)}) by (68).
8:   Update 𝐁\mathbf{B} and 𝜽\bm{\theta} by 𝐁(i)𝐁(i1)β𝐁i\mathbf{B}^{(i)}\leftarrow\mathbf{B}^{(i-1)}-\beta\nabla_{\mathbf{B}}\mathcal{L}_{\mathcal{B}_{i}} and 𝜽(i)𝜽(i1)β𝜽i\bm{\theta}^{(i)}\leftarrow\bm{\theta}^{(i-1)}-\beta\nabla_{\bm{\theta}}\mathcal{L}_{\mathcal{B}_{i}}.
9:  end for
10:  The trained probing codebook and inference module are obtained as 𝐁=𝐁(I)\mathbf{B}^{\star}=\mathbf{B}^{(I)} and Ψθ=Ψθ(I)\Psi_{\theta^{\star}}=\Psi_{\theta}^{(I)}.
Algorithm 2 Online Deployment of the Proposed SSI-Enhanced Type-II Feedback
0:  Trained probing codebook and inference mapping 𝐁\mathbf{B}^{\star} and Ψθ\Psi_{\theta^{\star}}
0:  Beamforming vector 𝐰^p\hat{\mathbf{w}}_{p}
1:  BS sweeps the learned probing codebook 𝐁\mathbf{B}^{\star} with SSB.
2:  UE feeds back the measured RSRP vector 𝐫𝐁\mathbf{r}_{\mathbf{B}}.
3:  BS infers the dominant subspace 𝐂p\mathbf{C}_{p} by (64).
4:  BS transmits low-dimensional beamformed CSI-RS over the inferred subspace by (48).
5:  UE feeds back the estimated effective channel by (50).
6:  BS reconstructs the DL CSI by (51) and applies the beamforming vector by (52).
TABLE I: Summary of Conventional and Proposed Limited-Feedback Schemes
 
 
Scheme Subspace 𝒰\mathcal{U} Overhead ToT_{o} UE Complexity Efficiency η\eta
 
Type-I span(𝐝𝐳I)\mathrm{span}(\mathbf{d}_{\mathbf{z}_{\rm I}}) Nt+1N_{t}+1 𝒪(Nt2OD)\mathcal{O}(N_{t}^{2}O_{D}) ηI|αlI|2glIl=1L|αl|2\eta_{\rm I}\!\approx\!\frac{|\alpha_{l_{\rm I}}|^{2}g_{l_{\rm I}}}{\sum_{l=1}^{L}|\alpha_{l}|^{2}}
Type-II span(𝐃𝒮II)\mathrm{span}(\mathbf{D}_{\mathcal{S}_{\rm II}}) Nt+2QN_{t}+2Q 𝒪(Nt2QOD+Q3)\mathcal{O}(N_{t}^{2}QO_{D}+Q^{3}) ηIIlQ,II|αl|2gl,IIl=1L|αl|2\eta_{\rm II}\!\approx\!\frac{\sum_{l\in\mathcal{L}_{Q,\rm II}}|\alpha_{l}|^{2}g_{l,\rm II}}{\sum_{l=1}^{L}|\alpha_{l}|^{2}}
PSC span(𝐄𝒮PSC)\mathrm{span}(\mathbf{E}_{\mathcal{S}_{\rm PSC}}) Nt+2NpN_{t}+2N_{p} 𝒪(Nt2)\mathcal{O}(N_{t}^{2}) ηPSC=𝐏PSC𝐡2𝐡2=i=1Np|h~|(i)2𝐡2\eta_{\rm PSC}\!=\!\frac{\|\mathbf{P}_{\rm PSC}\mathbf{h}\|^{2}}{\|\mathbf{h}\|^{2}}=\frac{\sum_{i=1}^{N_{p}}|\tilde{h}|_{(i)}^{2}}{\|\mathbf{h}\|^{2}}
Proposed span(𝐂p)\mathrm{span}(\mathbf{C}_{p}) K+2QK+2Q 𝒪(Q2)\mathcal{O}(Q^{2}) ηp=𝐏p𝐡2𝐡2\eta_{\rm p}\!=\!\frac{\|\mathbf{P}_{\rm p}\mathbf{h}\|^{2}}{\|\mathbf{h}\|^{2}}
 
 
\tab@right\tab@restorehlstate

VI-D Convergence and Complexity Analysis

Let 𝝃(𝐁,𝜽)\bm{\xi}\triangleq(\mathbf{B},\bm{\theta}) collect all trainable variables. Suppose each mini-batch i\mathcal{B}_{i} is uniformly sampled from s\mathcal{H}_{s}, and define the stochastic gradient

𝐠i𝝃i(𝝃i).\mathbf{g}_{i}\triangleq\nabla_{\bm{\xi}}\mathcal{L}_{\mathcal{B}_{i}}(\bm{\xi}_{i}). (69)

Assume that s(𝝃)\mathcal{L}_{\mathcal{H}_{s}}(\bm{\xi}) is lower bounded and LfL_{f}-smooth, and that the stochastic gradient is an unbiased estimator of the full gradient with bounded variance that decreases with the mini-batch size, i.e.,

𝔼[𝐠i]=𝝃s(𝝃i),𝔼[𝐠i𝝃s(𝝃i)2]σ2|i|.\mathbb{E}[\mathbf{g}_{i}]=\nabla_{\bm{\xi}}\mathcal{L}_{\mathcal{H}_{s}}(\bm{\xi}_{i}),\ \mathbb{E}\!\left[\|\mathbf{g}_{i}-\nabla_{\bm{\xi}}\mathcal{L}_{\mathcal{H}_{s}}(\bm{\xi}_{i})\|^{2}\right]\leq\frac{\sigma^{2}}{|\mathcal{B}_{i}|}. (70)

Then, the mini-batch SGD update 𝝃i+1=𝝃iγi𝐠i\bm{\xi}_{i+1}=\bm{\xi}_{i}-\gamma_{i}\mathbf{g}_{i} with a standard diminishing or sufficiently small constant stepsize converges in expectation to a first-order stationary point. In particular, for the usual 𝒪(1/I)\mathcal{O}(1/\sqrt{I}) stepsize scaling, the average gradient norm obeys the standard nonconvex SGD rate

1Ii=0I1𝔼[𝝃s(𝝃i)2]=𝒪(1I+σ2||I).\frac{1}{I}\sum_{i=0}^{I-1}\mathbb{E}\!\left[\|\nabla_{\bm{\xi}}\mathcal{L}_{\mathcal{H}_{s}}(\bm{\xi}_{i})\|^{2}\right]=\mathcal{O}\!\left(\frac{1}{\sqrt{I}}+\frac{\sigma^{2}}{|\mathcal{B}|\sqrt{I}}\right). (71)

This result follows from the standard descent analysis for smooth nonconvex stochastic optimization and shows that the proposed mini-batch joint solver approaches a stationary solution in expectation, with the gradient-variance term reduced by the mini-batch size.
We focus on the online deployment complexity, since the learning-based design is trained offline and its cost is amortized over the deployment site. During online deployment, the main complexity of the proposed scheme comes from BS-side subspace inference and UE-side effective channel estimation. For a depth-DD MLP with hidden width WW, the BS-side inference complexity is 𝒪(KW+(D1)W2+2NtQW)\mathcal{O}\!\left(KW+(D-1)W^{2}+2N_{t}QW\right), while the UE-side LS estimation complexity is on the order of 𝒪(Q2)\mathcal{O}(Q^{2}). By contrast, conventional feedback schemes place the dominant CSI-processing burden at the UE. Specifically, the UE-side complexity scales as 𝒪(Nt2OD)\mathcal{O}(N_{t}^{2}O_{D}) for Type-I, 𝒪(Nt2QOD+Q3)\mathcal{O}(N_{t}^{2}QO_{D}+Q^{3}) for Type-II with greedy beam selection, and 𝒪(Nt2+NtlogNt)\mathcal{O}(N_{t}^{2}+N_{t}\log N_{t}) for PSC, whereas the corresponding BS-side reconstruction complexity is only 𝒪(Nt)\mathcal{O}(N_{t}), 𝒪(NtQ)\mathcal{O}(N_{t}Q), and 𝒪(NtNp)\mathcal{O}(N_{t}N_{p}), respectively.
Table I summarizes the conventional Type-I, Type-II, PSC, and proposed schemes under the unified subspace-projection viewpoint. Since SSB sweeping is shared by all schemes, it is excluded from the overhead comparison, and only the additional SSB-stage reporting required by the proposed scheme is counted. As shown in Table I, the proposed site-specific Type-II retains Type-II-like subspace representation capability with only low-dimensional online refinement, thereby achieving a more favorable overhead-complexity-performance tradeoff.

VII Numerical Results

In this section, numerical results are provided to verify the effectiveness of the proposed end-to-end joint probing-and-inference design and the proposed site-specific Type-II feedback framework based on the DeepMIMO dataset [Alkhateeb2019DeepMIMO]. Key simulation parameters are summarized in Table II, unless otherwise stated. By default, we use the “asu_campus_3p5” scenario, a 3.5 GHz outdoor deployment with rich multipath, moderate angular spread, and abundant UE samples. We also use the “O1_28”, “O1B_28”, and “boston5g_3p5” scenarios for additional validation, which correspond to a 28 GHz simple line-of-sight (LoS) outdoor scenario, a 28 GHz blocked non-LoS (NLoS) outdoor scenario, and a 3.5 GHz complex city scenario, respectively.

TABLE II: Simulation settings
 
Parameter Description Value
 
fcf_{c} Carrier frequency 3.5 GHz
BW Bandwidth 10 MHz
PtP_{t} Transmit power 40 dBm
KK Size of SSB codebook 8
QQ Dimension of the inferred subspace 4
NtN_{t} Number of BS antennas 64
LsL_{s} Number of SSB symbols 20
TcT_{c} Coherence block length 1000
dd Antenna spacing λ/2\lambda/2
SnS_{n} Noise power spectrum density -170 dBm/Hz
σsh2\sigma_{\text{sh}}^{2} Log-variance of shadowing 1 dB
DD MLP depth 3
WW Hidden width 256
 
\tab@right\tab@restorehlstate
TABLE III: CSI-capture efficiency comparison of different probing schemes across representative DeepMIMO scenarios.
 
 
Scenario Proposed Random DFT
 
O1_28 0.99 0.90 0.91
O1B_28 0.99 0.93 0.90
asu_campus_3p5 0.89 0.62 0.73
boston5g_3p5 0.97 0.81 0.86
 
Average 0.96 0.82 0.85
 
 
\tab@right\tab@restorehlstate

VII-A Evaluation of the End-to-End Joint Design

VII-A1 Convergence Demonstration

We first demonstrate the convergence of the proposed mini-batch SGD solver for the joint design problem. Fig. 2 illustrates the convergence behavior of the proposed end-to-end solver under three representative (K,Q)(K,Q) settings. The curves show stable convergence for all configurations, and most of the performance gain is achieved within roughly the first 100100 epochs, after which the curves enter a clear saturation region. Moreover, the training and validation curves remain close to each other for all three settings, indicating that the learned probing-and-inference model generalizes well and does not suffer from severe overfitting. In addition, the converged capture efficiency increases consistently as the probing dimension and the inferred subspace dimension become larger. In particular, the (K,Q)=(16,8)(K,Q)=(16,8) configuration achieves the highest efficiency, followed by (16,4)(16,4) and (8,4)(8,4), which confirms that both richer probing measurements and a higher-dimensional inferred subspace are beneficial to the proposed framework.

Refer to caption
Figure 2: Convergence of the proposed end-to-end design
Refer to caption
Figure 3: Pareto-optimal overhead-performance illustration

VII-A2 Ablation Study of the End-to-End Design

To validate whether the gain of the proposed end-to-end solver indeed comes from the jointly learned probing encoder, we compare the CSI-capture efficiency of the proposed end-to-end design with two baseline probing modes under the same MLP decoder and training pipeline: i) a fixed random probing codebook, and ii) a fixed DFT probing codebook. Table III reports the comparison across four representative DeepMIMO scenarios. The table shows that the proposed scheme consistently achieves the highest capture efficiency in all scenarios. Concretely, in the relatively easier “O1_28” and “O1B_28” scenarios, all methods already achieve high capture efficiencies, while the proposed design still preserves a clear advantage. In more challenging environments, especially “asu_campus_3p5”, the gain becomes substantially larger, indicating that the learned probing codebook is more effective at generating informative fingerprints for downstream subspace inference when the propagation structure is more complex. Moreover, the relative ordering between the random and DFT baselines changes across scenarios, whereas the proposed design remains consistently superior, which further demonstrates its robustness and site-adaptive nature.

VII-A3 Overhead-Performance Tradeoff

The previous results already indicate that the probing dimension KK and the inferred subspace dimension QQ are the two critical design parameters of the proposed framework, as they determine the informativeness of the RSRP fingerprint and the expressiveness of the inferred CSI subspace, respectively. Fig. 3 further characterizes the resulting overhead-performance tradeoff in the “asu_campus_3p5” scenario, where the horizontal axis denotes the total online overhead and the vertical axis denotes the CSI-capture efficiency. The Pareto-optimal operating points are highlighted by the red circles and the connecting curve. The capture efficiency improves rapidly in the low-overhead regime, but soon enters a saturation region as the overhead increases, revealing a pronounced diminishing-return effect. More importantly, the Pareto frontier shows that high performance is achieved only when KK and QQ are jointly balanced: a small KK limits the discriminability of the RSRP fingerprint, while a small QQ limits the representational capability of the inferred subspace. Hence, increasing only one of the two dimensions is generally inefficient once the other becomes the bottleneck. This explains why several moderate-overhead configurations already approach the maximum capture efficiency, whereas further enlarging both KK and QQ brings only marginal gain. In addition, the effective spectral efficiency, indicated by the color map, is not maximized at the largest-overhead points, since the incremental subspace-capture gain is eventually outweighed by the increased overhead penalty.

VII-B Evaluation of the Proposed Feedback Scheme

TABLE IV: CSI-capture efficiency comparison of different feedback schemes in two representative scenarios.
 
 
Scheme Boston O1_28
 
Type-I 0.7017 0.7405
Type-II 0.9951 0.9992
PSC 0.2606 0.2510
Proposed 0.986 0.9999
 
 
\tab@right\tab@restorehlstate
Refer to caption
Figure 4: Effective spectral efficiency versus SNR

VII-B1 Scheme Comparison

Table IV and Fig. 4 compare the proposed scheme configured with (K,Q)=(16,8)(K,Q)=(16,8) against the Type-I, Type-II, and PSC baselines in the “boston5g_3p5” and “O1_28” scenarios under the same simulation settings. As expected, the proposed scheme attains a capture efficiency that is very close to, and in some cases even slightly higher than, that of Type-II. More importantly, despite this comparable raw efficiency, the proposed scheme consistently achieves the highest effective spectral efficiency over the entire SNR range. This result directly reflects the main advantage of the proposed SSI-enhanced framework: instead of pursuing marginal gains in instantaneous subspace optimality at the cost of full-dimensional CSI acquisition, it leverages site-specific inference to obtain Type-II-comparable subspace quality with much lower online overhead. Consequently, the proposed scheme converts Type-II-comparable capture efficiency into a strictly better system-level overhead-efficiency tradeoff. In other words, the key gain of the proposed framework comes not from universally outperforming Type-II in raw efficiency, but from matching Type-II in the relevant subspace-capture regime more efficiently.

VII-B2 UE Performance Statistics

Fig. 5 shows the cumulative distribution function (CDF) of the effective spectral efficiency of four feedback schemes at an SNR of 10 dB in the “asu_campus_3p5” and “O1B_28” scenarios. The proposed scheme exhibits the rightmost CDF in both scenarios, indicating that it delivers higher effective spectral efficiency for the majority of users rather than only improving the average performance. This result is consistent with the previous average-rate comparison and further confirms that the gain of the proposed framework is population-wide. In particular, although the proposed scheme and Type-II often achieve comparable raw CSI-capture efficiency, the lower online overhead of the proposed framework shifts its effective-spectral-efficiency distribution consistently to the right of Type-II. Therefore, the benefit of the proposed method does not come from universally dominating Type-II in instantaneous subspace quality, but from realizing Type-II-comparable subspace quality with much lower overhead and converting this advantage into better UE-level effective spectral efficiency. By contrast, Type-I remains limited by its rank-one representation, which leads to a visibly broader and left-shifted distribution, while PSC performs the worst in both scenarios due to its insufficient ability to capture the dominant channel energy in the considered port-domain setting.

Refer to caption
Figure 5: CDF of effective spectral efficiency

VII-B3 Physical Interpretation of the Inferred Subspace

Beyond rate and capture metrics, we further visualize the physical meaning of the learned subspace. Specifically, we select three representative test users in the “asu_campus_3p5” scenario and compare the angular responses of the proposed, Type-I, and Type-II subspace projectors. For each scheme s{p,I,II}s\in\{p,\mathrm{I},\mathrm{II}\}, let 𝐏s\mathbf{P}_{s} denote the corresponding subspace projector. Its angular response is evaluated over the azimuth angle φ\varphi as

Gs(φ)=𝐚H(φ)𝐏s𝐚(φ).G_{s}(\varphi)=\mathbf{a}^{H}(\varphi)\mathbf{P}_{s}\mathbf{a}(\varphi). (72)

For each user, the three responses and path powers are all normalized by their maximum value for visualization.

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Figure 6: Physical interpretation of the inferred subspace

Fig. 6 visualizes these angular responses. First, the response of the proposed scheme is consistently concentrated around the dominant physical path clusters, which confirms that the inferred subspace is not an arbitrary latent representation but has a clear geometric interpretation in the angular domain. Second, the proposed response closely resembles that of Type-II in the dominant angular regions, which explains why the proposed scheme achieves CSI-capture efficiency that is close to Type-II in the quantitative results. Third, the Type-I response is much narrower and typically covers only one local direction, which explains its substantially lower capture efficiency. Overall, these figures provide physical evidence that the proposed SSI-enhanced framework succeeds in inferring a low-dimensional subspace aligned with the dominant propagation geometry, thereby achieving Type-II-comparable subspace quality with significantly reduced overhead.

VIII Conclusion

This paper developed a unified subspace-projection framework for limited-feedback beamforming and proposed a site-specific Type-II feedback scheme. By using offline learned SSI together with low-overhead RSRP fingerprints, the BS infers a UE-dependent dominant subspace in advance, so that the UE only needs to estimate and feed back low-dimensional effective CSI coefficients within that subspace. Simulation results showed that the proposed scheme achieves a more favorable overhead-performance tradeoff than conventional feedback mechanisms, retaining Type-II-comparable subspace quality with substantially lower online CSI acquisition overhead and UE-side complexity. Future work may extend the framework to multi-user and multi-stream settings.

References