Skip to Main Content
Purpose

This study aims to investigate the performance of new theoretical non-asymptotic bounds of the least squares estimator for linear time-invariant (LTI) state-space models, building on prior foundational research.

Design/methodology/approach

Using both simulated and real-world datasets, the analysis examines the influence of noise characteristics, controllability and noise conditioning on estimation accuracy. Furthermore, the study validates the derived bounds on real datasets through, among others, cross-validation and bootstrap techniques.

Findings

The results confirm the validity of the theoretical non-asymptotic error bounds across various system configurations. Key insights highlight the role of system stability, dimensionality and noise properties in shaping estimation performance.

Research limitations/implications

While this study focuses on specific configurations and datasets, expanding the scope to include alternative estimation techniques and broader real-world systems could yield additional insights into the practical applicability of these bounds.

Practical implications

The findings provide actionable insights and practical guidelines for improving econometric modeling, financial time-series forecasting, adaptive systems control and environmental modeling, enabling improved system identification and parameter estimation techniques in these domains.

Originality/value

By using real datasets and advanced validation techniques, this study bridges the gap between theoretical bounds and practical applications, hopefully opening a novel perspective on least squares estimation performance in LTI systems.

The estimation of Linear time-variant (LTI) systems is a fundamental problem in statistics, with applications spanning diverse fields such as system identification, finance, biology, and various engineering and control domains (Ljung, 1986).

The performance of least squares estimators is particularly relevant in economics and finance, where accurate modeling of time-series data and state-space systems is essential for tasks such as financial forecasting, portfolio optimization, and macroeconomic policy modeling. This study bridges theoretical bounds and practical applications in these fields.

Historically, research on this topic has predominantly focused on the asymptotic convergence properties of specific estimation techniques, such as ordinary least squares (Ljung, 1976a), prediction error methods (Ljung, 1976b), and maximum likelihood. However, the finite-time behavior of estimation schemes has received comparatively less attention.

Recently, analyzing the finite-time properties of these estimation methods for LTI systems became a crucial area of interest because of new use cases and challenges in machine learning. For instance, these error rates dictate the performance of model-based reinforcement learning algorithms that rely on a linear quadratic regulator (Arora et al., 2018). These studies primarily investigate the finite-time properties of classical estimation methods, including ordinary least squares, for both observable (Simchowitz et al., 2018) and unobservable states (Oymak and Ozay, 2019; Djehiche and Mazhar, 2022). The findings often leverage advanced tools from statistical learning theory, random matrix theory, and self-normalizing processes.

In addition, some recent works have provided information-theoretic perspectives, offering lower bounds on the sample complexity required to achieve a specified estimation accuracy (Jedra and Proutière, 2019; Djehiche and Mazhar, 2021; Djehiche et al., 2025). These lower bounds quantify the minimal number of observations needed to estimate the system’s parameters.

Among the most comprehensive contributions to this field is (Djehiche et al., 2025), which establishes both upper and lower bounds for the least squares estimator in LTI systems. These bounds are nearly tight, differing only by an unknown positive constant and parameters reflecting the estimation task’s complexity, such as system stability, observability, and noise intensity.

The current work builds on these theoretical foundations and aims to validate the derived error bounds in practical settings using both simulated and real-world data. Specifically, this study has three primary objectives:

  1. Validate theoretical error bounds for least squares estimation across various system configurations.

  2. Investigate the influence of factors such as system stability, noise properties, and system dimensionality on estimation performance.

  3. Demonstrate the practical applicability of the results through experiments on synthetic and real-world datasets.

Unlike simulated data, where the system matrices are generated with controlled properties, real-world datasets present additional challenges such as unknown dynamics, and data variability. Preprocessing steps, including normalization and ensuring compatibility with the LTI modeling framework, were applied to prepare the datasets for analysis. The two datasets are

  • Beijing Multi-Site Air-Quality Dataset: A dataset from the UCI Machine Learning Repository containing hourly air pollutant measurements for evaluating model robustness in environmental monitoring.

  • Dynamical System Multivariate Time Series Forecast Dataset: A synthetic dataset from Kaggle designed to mimic real-world dynamical systems for testing estimation methods. This dataset mirrors the dynamics observed in financial markets, where multivariate time series are used to model interactions between economic indicators or asset prices.

Validation on these datasets involves estimating the system matrices and comparing the observed estimation errors with the theoretical bounds derived from the controllability and noise structure. This real-world analysis complements the simulated experiments, highlighting the bounds’ applicability across diverse scenarios. The chosen datasets and simulation setups are representative of challenges faced in financial econometrics and macroeconomic modeling, where noise, stability, and high-dimensional data are common obstacles.

The structure of the paper is as follows. After this introduction and a brief outline of commonly used notations, we introduce the LTI system estimation problem in Section 2. This is followed by a review of the classical least squares solution, the standard assumptions for deriving error estimates, and the key results from (Djehiche et al., 2025) that provide the primary error bounds, discussed in Section 1.

Section 3 outlines the methodology for validating these theoretical findings. It details the randomized generation process for LTI system parameters used in numerical experiments, the simulation procedure for generating system trajectories, the estimation method for LTI parameters, the validation tests conducted on simulated data, and the datasets employed to assess the practicality and applicability of the theoretical bounds in real-world conditions.

Section 4 presents an analysis of the simulation results, highlighting the degree to which the theoretical upper and lower error bounds predict the least squares estimator’s performance and examining the complexity parameters’ influence on estimation errors. Section 5 demonstrates the practical effectiveness of the least squares estimator in real-world data and evaluates the accuracy of the theoretical bounds using two datasets: one for the task of time series forecasting and another for the environmental task of air quality estimation in Beijing.

Finally, Section 6 discusses the key insights from this empirical study, and concludes the paper.

Throughout this work, we use the following notation:

  • We denote by W≔ diag(w1, , wd) a matrix in Md×d(C) whose only nonzero entries are on the diagonal, with Wii = wi. A real matrix WMd×d(R) admits a singular value decomposition (SVD) W = UΣV, where U, V ∈ O(d) (the group of orthogonal matrices) and Σ = diag(s1(W), , sd(W)) contains the singular values, satisfying 0 ≤ s1(W) ≤ s2(W) ≤ ⋯ ≤ sd(W). We define smax(W) = sd(W), smin(W) = s1(W), and the condition number cond(W)=smax(W)smin(W).

  • A matrix WMd×d(R) is diagonalizable if it can be expressed as W = SΓS−1, where S is an invertible matrix (the change of basis matrix) and Γ = diag(λ1, , λd) is the diagonal matrix of eigenvalues. Examples of diagonalizable matrices include symmetric and positive definite matrices. For a positive definite matrix, the decomposition W = SΓS−1 is referred to as its spectral decomposition, where all eigenvalues λi are positive. We denote the largest eigenvalue of a diagonalizable matrix W (in magnitude) by λmax(W) and the smallest by λmin(W), and recall the inequalities smax(W) ≥|λmax(W)|≥|λmin(W)|≥ smin(W).

  • We denote by (Ω,F) the underlying measurable space. A probability measure defined on it is denoted by P, and E represents the corresponding expectation operator.

  • All norms are indicated with a subscript identifying the corresponding normed space, except for the absolute value |⋅|. The p-norms of a vector xRn are given by

The p-Schatten norm of a matrix W in the Schatten space Sp is defined as

where S2n1={xRn|x|21} denotes the Euclidean unit sphere in Rn. We also have |W|S=smax(W).

The spaces Lp(Ω,F,P)={x:ΩR measurable, E(|x|p)<} of random variables are equipped with the Lp-norms:

where ess sup denotes the essential supremum with respect to the measure P.

  • Throughout this work, C denotes positive constants whose exact values are immaterial to the results. The notation x ≲ y (resp. x ≳ y) means a positive constant C exists such as x ≤ Cy (resp. x ≥ Cy). The minimum (maximum) of two real numbers x and y is denoted as min(x, y) = xy (max(x, y) = xy).

  • The notation x ≃ y indicates that x and y are comparable up to constant factors, meaning there exist positive constants c and C such that cy ≤ x ≤ Cy.

We consider the LTI state-space model:

(1)

where AMd×d(R) is a parameter matrix and i represents Gaussian noise. Here, BMd×r(R) is an unknown, rectangular, full-rank matrix with r ≤ d, and the noise sequence (εi)i is i.i.d. multivariate normal N(0,Ir). For simplicity, we assume x0 = 0. Expanding (1), we derive the following expression for the observed covariates for i ∈ [[1, N]]:

(2)

In the following definition, we introduce the concept of a controllable system.

Definition 2.1.

The system [A, B] with AMd×d(C) and BMd×r(C) is said to be controllable if there exists an integer d¯ such that the columns of the d¯-order controllability matrix

span Rd. We also define the controllability order dˇ as the smallest such d¯.

The least-squares estimator for the parameter matrix A is given by:

(3)

where M* denotes the conjugate transpose of the matrix M, and y22i=1n|yi|2. In Lemma 2.1 of (Djehiche et al., 2025) it is shown that the sample covariance matrix for the model (1) is invertible with probability one if the pair [A, B] is controllable and N ≳ d.

The behavior of the least square estimator for estimating the LTI dynamics governed by xt+1 = Axt + t is strongly influenced by the stability properties of the matrix A. Stability classification depends on the eigenvalues of A, with the following cases:

  • Stable systems: |λi| < 1 for all eigenvalues λi of A.

  • Unstable systems: |λi| > 1 for any eigenvalue λi of A.

  • Limit-stable systems: |λi| = 1 for some eigenvalues λi of A.

Theoretical bounds on the least-squares estimator Aˆ offer valuable insights into its estimation accuracy. These bounds are also influenced by the dimension of the matrix A and the controllability properties of the pair [A, B]. Specifically, for the least-squares estimator Aˆ of the auto-regression matrix A, the following results hold for the estimation loss Lp(Aˆ,A) for p ∈ [1, ] and the mean square estimation risk E2(Aˆ,A) defined as follows:

Below, we summarize the bounds for the estimation loss and mean square risk, depending on the stability of the matrix A, while highlighting the contribution of the complexity parameters of the pair [A, B]. By complexity parameters, we refer to the dimension of the matrix A, the controllability order dˇ for the system [A, B], the noise properties (i.e. the strength |B|S and the conditioning of B), and the controllability properties (i.e. the locations of the singular values and the condition number of the controllability C(A, B)). The following upper bounds are derived from Theorems 2.4 and 2.5 of (Djehiche et al., 2025), while the lower bounds are based on Theorem 3.2 and Corollary 4.1 of the same work:

If A consists solely of a stable part, then for δ ∈ (0, 1) and sample size N satisfying:

(4)

with probability at least 1 − δ, the least squares estimator Aˆ satisfies for the estimation loss:

(5)

the following lower bound for the mean square estimation risk holds:

(6)

If A consists solely of an unstable part, then for δ ∈ (0, 1) and N satisfying:

(7)

with probability at least 1 − δ, the least square estimator satisfies the following bound on the estimation loss:

(8)

and the following lower bound for the mean square estimation risk is given by:

(9)

If A consists of a limit-stable part, then for δ ∈ (0, 1 − e−5) and

(10)

the least squares estimator satisfies the following bound on the estimation loss:

(11)

where

(12)

Furthermore, the following lower bound holds for the mean square estimation risk:

(13)

Empirical validation of these theoretical bounds is explored under varying conditions of B and C(A, B).

This section outlines the methodology for validating theoretical bounds in linear regression for LTI dynamic systems. The methodology includes the systematic generation of the data from a dynamic systems across stability regimes, evaluation of noise and matrix conditioning effects, and validation using simulated and real-world data.

LTI Dynamic systems are modeled using the state evolution equation:

(14)

where A is the system matrix, B is the control matrix, and ϵt is Gaussian noise with zero mean and covariance matrix Ir. To evaluate theoretical bounds under diverse conditions, the eigenvalues of A are constrained to three stability regimes, we avoid mixing eigenvalues from different regimes for the sake of clarity of the results:

  • Stable Systems: Eigenvalues λi of A are sampled uniformly from U(0.1, 0.9), ensuring |λi| < 1. The matrix B is randomly generated and normalized to have a Schatten 2-norm of 1. Well-conditioned B matrices have condition numbers below 10, while ill-conditioned B matrices have condition numbers between 40 and 10,000.

  • Unstable Systems: Eigenvalues λi of A are sampled uniformly from U(1.05, 1.2), ensuring |λi| > 1. The matrix B is normalized similarly to the stable case. Well-conditioned B matrices have condition numbers below 10, and ill-conditioned B matrices have condition numbers between 200 and 500.

  • Limit-Stable Systems: Eigenvalues lie on the unit circle (|λi| = 1), either constructed as λi = e, with θ uniformly spaced between 0 and 2π, or rescaled from an initial A matrix. Well-conditioned B matrices have condition numbers below 10, and ill-conditioned B matrices have condition numbers between 10 and 1,000.

The controllability matrix C(A, B) is constructed iteratively to ensure the (A, B) pair satisfies the controllability criterion. The rank and condition number of C(A, B) are verified, and if the pair does not meet the required conditions, the matrices A and B are regenerated or adjusted through scaling, perturbation, or rank modification.

For each (A, B) pair, system trajectories are simulated over N = 1,000 time steps. The state evolution is governed by the equation xt+1 = Axt + t, with ϵt as Gaussian noise scaled by σ. Two noise levels (σ = 0.1, 1.0) are used to evaluate the impact of noise on estimation accuracy.

Using the least squares algorithm, multiple estimates Aˆ are computed for each simulated trajectory. Two key metrics are evaluated: (1) the Mean Squared Error (MSE) risk E2(Aˆ,A), computed as the square Frobenius norm (Schatten 2-norm) of AAˆ, averaged over 100 trials, and (2) the estimation error L2(Aˆ,A), computed as the Schatten 2-norm of AAˆ for a single trial.

Two primary tests are conducted to validate the theoretical bounds:

  • Estimation Error Decay Rate and Dimension: This test evaluates the error decay rates for different stability regimes:

    • Stable Systems: the predicted decay rate is

(15)
  • Unstable Systems: the theory predicts an exponential decay rate given as

(16)
  • Limit-Stable Systems: the theory gives the same decay rate up to a dimension mismatch indeed the upper bound rate is

The rates above disregard contributions from minor terms and are given for the upper bounds. The rates for the lower bounds are similar with minor changes where instead of taking smin and d1/2|B|S you would get smax and |B|S2, except for limit stable systems where a factor d is missing from the lower bound.

  • Noise and Conditioning Impact: This test investigates the sensitivity of estimation error to noise characteristics and matrix conditioning, the tests are conducted by:

    • Varying the noise variance σ2 with a noise matrix σB of fixed B and varying the condition number cond(B) for a fixed noise variance σ2.

    • Varying the conditioning of the controllability matrix C(A, B) between regimes of well-conditioned and poorly-conditioned.

To assess the practical applicability of the theoretical bounds, validation was performed using real-world datasets. Unlike simulated data, where the system matrices A and B are generated with controlled properties, real-world datasets provide only the observed time series (xi)i. Therefore, the characteristics of A and B must be estimated before proceeding to the estimation and the analysis. Preprocessing steps include normalization, noise filtering, and ensuring compatibility with the modeling framework. The following datasets were used for validation:

3.4.1 Dynamical System Multivariate Time Series Forecast Dataset

This dataset, available on Kaggle [Data], contains multivariate time series data generated from a dynamical system. It is well-suited for testing control theory models, including Linear Time-Invariant (LTI) systems, as it captures complex time-dependent relationships in a controlled environment. The dataset provides a structured setting to validate theoretical bounds under conditions that mimic dynamic system behavior.

3.4.2 Beijing Multi-Site Air-Quality Dataset

This dataset, hosted by the UCI Machine Learning Repository [Data], tracks hourly measurements of air pollutants across multiple locations in Beijing. Here, we focus on the area of Aotizhongxin. The dynamics of air quality, influenced by time-varying processes such as weather conditions and emissions, can be approximated using LTI models. This dataset is particularly valuable for evaluating the robustness of theoretical bounds when applied to noisy, real-world processes.

Validation on these datasets involves estimating the system matrices A and B and comparing the observed estimation errors with the theoretical bounds derived from the controllability and noise structure. This real-world analysis complements the simulated experiments, highlighting the bounds’ applicability across diverse scenarios.

This section evaluates the error and risk decay rates for different system configurations depending on the stability of the matrix A and validates theoretical results that we summarized previously.

4.1.1 Stable systems

We conduct a series of simulations on stable systems with varying dimensions, tracking the estimation error and MSE risk. These results are then compared to the theoretical upper and lower bounds, which are plotted for reference.

  • Theoretical Prediction: According to the previously mentioned theoretical results in (15), the upper and lower estimation risk both agree in their dependence on the sample size N and the dimension d.

  • Empirical Results: This first series of simulation represented in Figure 1 show that indeed the estimated error stays between the predicted upper bound on the estimation loss and the predicted lower bound for the mean square estimation risk.

Figure 1

Comparison of upper and lower bounds for stable system

Figure 1

Comparison of upper and lower bounds for stable system

Close modal

4.1.2 Unstable systems

We conduct a second series of simulation with unstable systems of varying dimensions while tracking the estimation error. Since the estimation error is theorized to decrease exponentially fast in the sample size with the largest eigenvalue, the experiment is made for relatively small samples and the simulated systems do not have eigenvalues deep in the instability region.

  • Theoretical Prediction: According to the theoretical results the upper estimation loss and the lower estimation risk are both of the same order of dependence on the dimension d, and decrease exponentially fast with the sample size. The order for the upper bound depends on the smallest singular value of A and for the lower bound on the largest singular value of A. Up to this mismatch the order of both is roughly as in (16).

  • Empirical Results: The simulations in Figure 2 also confirm the theoretical error rates since the estimated error stays between the predicted upper bound and lower bound. The rate in N is indeed exponential and depends on the spectral properties of the matrix A. This exponential behavior makes the estimation task rather easy, something to be contrasted with the control task since controlling an unstable system is in general an onerous task as the controller has to bring the system into stability first.

Figure 2

Comparison of upper and lower bounds for unstable system

Figure 2

Comparison of upper and lower bounds for unstable system

Close modal

4.1.3 Limit stable systems

We conduct a third series of simulations with limit stable systems of varying dimensions while tracking the estimation error. We recall that there is a mismatch in the dependence on the dimension of the matrix A in the theory between upper and lower bound estimation error estimates, plus an improvement in the dependence on the sample size N over stable systems.

  • Theoretical Prediction: According to the theoretical results the upper estimation loss and the lower estimation risk are both of the same order of dependence on the sample size N up to some lower order logarithmic term, while they differ by a factor of d in the dimension. Indeed the least square estimator satisfies the following bound on the estimation loss:

(17)

and the following lower bound for the mean square estimation risk:

(18)
  • Empirical Results: While the simulations in Figure 3 confirms the theoretical error rates dependence on the sample size N, the dependence on the dimension is not as clear although the empirical results are closer to the upper bounds which suggest that the convergence rate for the error estimate is d/N.

Figure 3

Comparison of upper and lower bounds for limit stable system

Figure 3

Comparison of upper and lower bounds for limit stable system

Close modal

This section presents an empirical study to validate the theoretical upper and lower bounds on estimation accuracy. We evaluate the effects of noise level, structure, and conditioning of B, as well as the controllability of the system matrix C(A, B). For all experiments, the Schatten norm |B|S2=1 is maintained to focus on the influence of noise conditioning and system controllability.

In this study:

  1. The number of observations N = 1,000 (Stable case), N = 100 (Unstable case), N = 1,000 (Limit stable case);

  2. For a square matrix B, a 2 × 2 dimension is used;

  3. For a rectangular matrix B, a 2 × 3 dimension is used and

  4. For the tables related to controllability and noise conditioning impact on estimation accuracy, the noise level is set to σ = 1.

Tables 1, 3, and 5 show results where B structure, noise level σ, and conditioning cond(B) are varied. Estimation error, mean squared estimation risk (MSE risk), and theoretical bounds are reported to measure the impact of noise structure on estimation accuracy.

Table 1

Noise characteristics impact on estimation accuracy and bounds

Dim. Bcond(B)Noise levelMSE riskEst. errUpper bndLower bnd
(2, 2)2.130.10.0600.0380.2950.040
(2, 3)1.830.10.0590.0460.2540.041
(2, 2)6149.420.113.9279.96990.8860.037
(2, 3)7251.180.11.1631.15111.9930.036
(2, 2)2.131.00.0530.0530.2950.040
(2, 3)1.831.00.0630.0810.2540.041
(2, 2)6149.421.03.1643.13424.9770.037
(2, 3)7251.181.01.1491.13413.4360.036

Source(s): Table by authors

Table 3

Noise characteristics impact on estimation accuracy and bounds

Dim. BCond(B)Noise levelMSE riskEst. errUpper bndLower bnd
(2, 2)2.130.10.3420.3480.0295.28 × 10–4
(2, 3)5.260.10.4020.3350.0724.85 × 10–4
(2, 2)7170.760.10.1910.05423.6174.77 × 10–4
(2, 3)6521.420.10.5350.21521.7174.78 × 10–4
(2, 2)2.131.00.2050.3160.0295.28 × 10–4
(2, 3)5.261.00.5090.0780.0724.85 × 10–4
(2, 2)7170.761.00.7552.20323.6174.77 × 10–4
(2, 3)6521.421.01.1110.02921.7174.78 × 10–4

Source(s): Table by authors

Table 5

Noise characteristics impact on estimation accuracy and bounds

Dim. BCond(B)Noise levelMSEEst. errUpper bndLower bnd
(2, 2)2.130.10.0050.0020.0415.52 × 10–4
(2, 3)1.830.10.0030.0010.0885.70 × 10–4
(2, 2)5072.540.10.0390.02452.3975.00 × 10–4
(2, 3)6767.880.10.0110.0021.1665.00 × 10–4
(2, 2)2.131.00.0090.0070.0415.52 × 10–4
(2, 3)1.831.00.0050.0040.0885.70 × 10–4
(2, 2)5072.541.00.0470.01452.3975.00 × 10–4
(2, 3)6767.881.00.0130.0071.1665.00 × 10–4

Source(s): Table by authors

Tables 2, 4, and 6 present results for systems with varying controllability C(A, B) conditions, which are either well-conditioned or poorly-conditioned, in conjunction with different structures of B and its condition number cond(B). The results demonstrate how controllability affects estimation error under these varying conditions.

Table 2

Controllability and noise conditioning impact on estimation accuracy and bounds

Dim. BRank(B)Cond(B)Cond(C(A, B))MSE riskEst. errorUpper bndLower bnd
(2, 2)21.181.120.0450.0190.1780.02734
(2, 2)21.871483.430.0570.0350.2330.03374
(2, 2)2235.508.640.1620.0782.5500.04250
(2, 2)21396.321386.366.5483.78064.6740.04262
(2, 3)22.062.080.0760.0590.2590.03299
(2, 3)11.361.350.0450.0350.2020.02769
(2, 2)162.326.000.1470.0411.4290.03511
(2, 2)110052.751183.641.0700.38218.2000.04262

Source(s): Table by authors

Table 4

Controllability and noise conditioning impact on estimation accuracy and bounds

Dim. BRank(B)Cond(B)Cond(C(A, B))MSE riskEst. errorUpper bndLower bnd
(2, 2)21.181.110.07660.00280.02432.42 × 10–5
(2, 2)22.34131.820.00780.01690.05803.23 × 10–5
(2, 2)233.878.270.17390.0036123.20294.87 × 10–6
(2, 2)225.80109.710.00340.003225.87791.01 × 10–5
(2, 3)22.403.020.00610.01270.06003.27 × 10–5
(2, 3)14.988.270.01520.00320.18764.10 × 10–5
(2, 3)12.68143.650.01230.00300.07023.43 × 10–5
(2, 2)138.778.660.06340.0044297.34033.15 × 10–6
(2, 2)140.13124.090.21970.0021404.26612.75 × 10–6

Source(s): Table by authors

Table 6

Controllability and noise conditioning impact on estimation accuracy and bounds

Dim. BRank(B)Cond(B)Cond(C(A, B))MSE riskEst. errorUpper bndLower bnd
(2, 2)22.132.130.0050.0090.0415.36 × 10–4
(2, 2)23.66787.280.0060.0110.4053.83 × 10–4
(2, 2)2574.63574.630.0090.0030.0309.32 × 10–6
(2, 2)212.769.000.0220.03225.6071.80 × 10–4
(2, 3)21.951.950.0080.0060.1015.70 × 10–4
(2, 3)21.30143.910.0040.0040.0287.84 × 10–4
(2, 3)11.181.180.0080.0080.0168.50 × 10–4
(2, 2)120.8220.820.0670.08342.1831.15 × 10–4

Source(s): Table by authors

4.2.1 Stable case

The theoretical bounds in the table are consistent with the observed errors, particularly for well-conditioned cases, where the bounds are tight. Moreover, the estimation error and the MSE stay close to the lower bound even though the upper bound diverges.

The results show that for stable systems, the error rate stays close to the lower bound regardless of the condition number and the dimension of the matrix B. This stability highlights the robustness of the estimator and validates the theoretical predictions. The result confirms also the observation that the error is invariant to the noise level and is only more sensitive to the noise matrix conditioning.

Indeed, if we take B = σId, we obtain, for the least square estimator,

(19)

which is independent of the noise variance since xiσ=k=0i1Ai1kεk.

Table 2 validates the influence of the controllability matrix C(A, B) on the derived theoretical bounds. For well-conditioned controllability matrices (cond(C(A, B)) < 10), the bounds are tight and align well with the observed MSE risks and estimation errors. However, as cond(C(A, B)) increases, the bounds diverge, particularly for poorly conditioned cases, reflecting the amplified estimation uncertainty predicted by the theory.

The rank and dimension of B have minimal impact on the accuracy as long as C(A, B) is well-conditioned, confirming that the bounds depend primarily on the characteristics of C(A, B). These results emphasize the importance of maintaining well-conditioned controllability matrices to achieve reliable error bounds and accurate estimations.

4.2.2 Unstable case

The theoretical bounds for the unstable case are also consistent with the observed errors and are only affected by the matrices B and C(A, B) conditioning. Nonetheless, we observe that for the well-conditioned case, the experimental results fall outside the theoretical upper and lower bounds both in Tables 3 and 4. This is because the estimation error decreases exponentially fast for unstable matrices, while the theoretical bounds hold for sample sizes that do not allow capturing this phenomenon in the experiment.

Indeed, the theoretical bounds hold for

(20)

which gives theoretical upper and lower bounds that are well below the precision limits. Instead, we report experimental results for N = 100 to keep the error rates reasonable. When, doing experiments with ill-conditioned matrices B and C(A, B) the error rate does not decrease too fast, so the MSE and estimation error both stay between the theoretical upper and lower bounds.

4.2.3 Limit stable case

Similar to the stable case, the theoretical bounds for the limit stable case align well with the observed errors for well-conditioned systems, where the bounds remain tight both for well-conditioned or ill-conditioned matrices B and C(A, B). These results are reported in Tables 5 and 6.

We apply the least squares estimator to two real-world datasets to demonstrate its practical applicability.

This synthetic dataset simulates multivariate time series data from dynamical systems with stable configuration. The dataset consists of multiple variables evolving over time, capturing interactions between components of the system.

5.1.1 Estimation results and analysis

This section presents the results of the system estimation and analysis. The system matrices A and B were estimated from a large dataset after preprocessing, which included differencing for stationarity. Cross-validation and bootstrap techniques were used to evaluate the robustness and confidence in the estimation. The key results are summarized in Table 7. The estimated A-matrix captures the system dynamics, while the B-matrix represents the effect of white noise disturbances. Eigenvalue analysis of A indicates that the system is stable since all eigenvalues lie within the unit circle. Additionally, the mean cross-validation error and a 99% confidence interval for the estimation error were calculated to assess the accuracy of the estimated matrices. A theoretical upper bound for the estimation error was also derived based on the controllability properties of the system.

Table 7

Summary of estimation results

ParameterValue
Estimated A Matrix2.23×1064.16×1034.45×1041.80×1044.77×1037.30×1037.45×1021.87×1021.35×1033.04×1038.99×1011.43×1021.03×1041.40×1032.93×1029.81×101
Estimated B Matrix1.18×1035.87×1031.60×1035.13×1033.76×1032.52×1031.64×1032.80×1052.37×1035.66×1049.85×1041.00×104
Eigenvalues of A0.00378991 + 0.00243652i, 0.00378991–0.00243652i, 0.90487607, 0.97503101
System StabilityStable (all eigenvalues within the unit circle)
Mean Cross-Validation Estimation Error0.0062 (Frobenius norm)
99% Confidence Interval for Estimation ErrorLower Bound: 0.0049, Upper Bound: 0.0161
Theoretical Upper Bound for Estimation Error0.352
Theoretical Lower Bound for Estimation Risk0.0047
Empirical Estimation risk0.6773

Source(s): Table by authors

5.1.2 Bootstrap analysis and visualization

The accuracy and robustness of the estimated matrices were further validated using bootstrap analysis. Figure 4 shows the distribution of bootstrap estimation errors along with the mean cross-validation error and the 99% confidence interval bounds. The histogram indicates that the errors are tightly clustered, demonstrating the reliability of the estimation.

Figure 4

Distribution of bootstrap estimation errors with confidence interval

Figure 4

Distribution of bootstrap estimation errors with confidence interval

Close modal

5.1.3 Discussion

  • Stability Analysis: The eigenvalues of the estimated A-matrix lie within the unit circle, confirming the stability of the identified system dynamics. This stability is critical for ensuring the validity of the theoretical bounds and the applicability of their rates in practice. In this case, we would expect a decay rate of (d/N)1/2 and adjust the sample size accordingly.

  • Cross-Validation and Bootstrap Analysis: The mean cross-validation error is remarkably low, with a value of 0.00158, and the 99% confidence interval spans a narrow range from 0.0009 to 0.0066. Bootstrap analysis further corroborates the reliability of the estimation, demonstrating that errors are tightly clustered and well within the confidence bounds.

  • Conservative Theoretical Upper Bound: The derived theoretical upper bound of 0.43631 for the estimation error is higher than the observed errors. This discrepancy highlights the conservativeness of the theoretical bound, providing assurance that practical applications are likely to perform even better than theoretically anticipated.

  • Estimation Effectiveness: The low estimation errors observed across various analyses underscore the effectiveness of the estimation process. These results emphasize the method’s ability to capture system dynamics accurately, ensuring reliable forecasting performance for multivariate time series data.

The dataset, titled “Beijing Multi-Site Air-Quality Data”, includes hourly measurements of air pollutants such as PM2.5, PM10, SO2, NO2, CO, and O3 from 12 nationally controlled monitoring sites in Beijing. In this study, we use the data related the area of Aotizhongxin in Beijin. The data spans from March 1, 2013, to February 28, 2017, and exhibits both seasonal patterns and long-term trends influenced by meteorological and anthropogenic factors.

5.2.1 Estimation results and analysis

As in the previous subsection, we present the results of estimating the system matrices A and B from the dataset. The key results are summarized in Table 8. Preprocessing steps included differencing for stationarity and Z-score normalization. The same methods were used to evaluate the robustness and accuracy of the estimation.

Table 8

Summary of estimation results for Beijing air pollution dataset

ParameterValue
Estimated A Matrix0.08240.06810.03550.05990.02630.02660.17850.04520.03280.02170.01030.01180.09940.03720.06180.01370.03560.07490.20920.02550.00210.00340.03670.01650.0208
Estimated B Matrix0.00190.00530.00680.00490.00410.01380.00020.00080.00160.00110.00160.01070.00470.00200.00020.01340.00610.00340.00770.0022
Eigenvalues of A0.0434, 0.0565 ± 0.0339i, 0.1969, 0.2369
System StabilityStable (all eigenvalues within the unit circle)
Mean Cross-Validation Estimation Error0.0339 (Frobenius norm)
99% Confidence Interval for Estimation ErrorLower Bound: 0.0321, Upper Bound: 0.1168
Theoretical Upper Bound for Estimation Error2.2927
Theoretical Lower Bound for MSE Risk0.0857
Empirical MSE Risk0.2131
Controllability Matrix Rank5 (Full Rank)
Condition Number of Controllability Matrix11.92
Smallest Singular Value of Controllability Matrix0.0017

Source(s): Table by authors

5.2.2 Bootstrap analysis and visualization

Bootstrap analysis was conducted to validate the robustness of the estimation process. Figure 5 displays the distribution of bootstrap estimation errors, highlighting that the errors are tightly clustered around the mean, indicating reliable estimation results.

Figure 5

Distribution of bootstrap estimation errors for Beijing dataset

Figure 5

Distribution of bootstrap estimation errors for Beijing dataset

Close modal

5.2.3 Discussion

  • System Stability: The estimated system is confirmed to be stable, as all eigenvalues of the A-matrix lie within the unit circle. This stability analysis is critical for ensuring reliable and predictable system dynamics estimates. It allows us to adjust N the sample size to achieve the desired accuracy using the theoretical upper bounds for guidance.

  • Low Estimation Errors: Both cross-validation and bootstrap analyses demonstrate low estimation errors, with the mean Frobenius norm of the error recorded at 0.0339. These results indicate the high accuracy of the parameter estimation process.

  • Conservative Theoretical Bound: The theoretical upper bound for the estimation error, calculated as 2.2927, is consistent with the observed results. However, this bound is conservative when compared to the empirical risk measures, providing an additional margin of confidence in practical applications.

  • Controllability Metrics: The system exhibits strong controllability characteristics, supported by a full-rank controllability matrix and acceptable singular values. These metrics validate the system’s well-conditioned nature and enhance the reliability of the estimated parameters.

This study highlights the utility of non-asymptotic bounds in improving econometric modeling and financial system identification. By understanding the finite-time behavior of least squares estimators, practitioners in finance and economics can better handle noise and dimensionality challenges, enabling more robust forecasting and decision-making models. The following key insights are drawn from the analysis:

Theoretical Implications: The results confirm that error decay rates align closely with theoretical predictions under varying stability regimes. For stable systems, the error decreases proportionally to O(d1/2/N1/2), consistent with derived upper and lower bounds. In unstable systems, the exponential decay rate observed is governed by the spectral properties of the system matrix, highlighting the interplay between eigenvalues and sample size. Limit-stable systems exhibit an improved convergence rate, albeit with a mismatch in dimensional dependence between theoretical bounds. These findings reinforce the robustness of theoretical frameworks in capturing the finite-time behavior of least squares estimators across different configurations.

Practical Implications: Noise characteristics and system dimensionality significantly impact estimation accuracy in practical scenarios. Sensitivity tests revealed that estimation errors increase with higher noise levels and poorly conditioned matrices. Real-world validation on the Beijing Air Quality dataset demonstrated that empirical estimation errors are consistently below conservative theoretical upper bounds, confirming the practical utility of these bounds. Moreover, system stability, verified through eigenvalue analysis, emerged as a critical factor for ensuring reliable estimation accuracy in dynamic environments since it allows us to adjust the sample size.

In conclusion, this work bridges theoretical and empirical analyses, demonstrating the efficacy of least squares estimators in diverse scenarios. While confirming theoretical bounds, the study also highlights practical uses of the theoretical upper and lower bounds for controlling the accuracy by estimating the system complexity parameters and adjusting the sample size accordingly. By bridging theoretical insights and practical validations, this study provides valuable tools for modeling economic systems, such as analyzing the stability of macroeconomic policies, and financial systems, like assessing the impact of shocks on asset prices. Future work can extend this research by deriving similar bounds to high dimensional nonlinear systems and applying them to more complex data observed in global financial markets and economic interactions.

This work was supported in part by the NYUAD Center for Interacting Urban Networks (CITIES), funded by Tamkeen under the NYUAD Research Institute Award CG001 and in part by the NYUAD Research Center on Stability, Instability and Turbulence (SITE), funded by Tamkeen under the NYUAD Research Institute Award CG002.

Arora
,
S.
,
Hazan
,
E.
,
Lee
,
H.
,
Singh
,
K.
,
Zhang
,
C.
and
Zhang
,
Y.
(
2018
), “
Towards provable control for unknown linear dynamical systems
”,
available at:
 https://openreview.net/forum?id=BygpQlbA-
Djehiche
,
B.
and
Mazhar
,
O.
(
2021
), “
Non-asymptotic estimation lower bounds for LTI state space models with Cramér-Rao and van Trees
”,
arXiv preprint
,
arXiv:2109.08582, available at:
 https://arxiv.org/abs/2109.08582
Djehiche
,
B.
and
Mazhar
,
O.
(
2022
), “
Efficient learning of hidden state LTI state space models of unknown order
”,
arXiv preprint
,
arXiv:2202.01625, available at:
 https://arxiv.org/abs/2202.01625
Djehiche
,
B.
and
Mazhar
,
O.
(
2025
), “An optimal non-asymptotic journey through system identification with least squares”, in
Thach
,
N.N.
,
Trung
,
N.D.
,
Ha
,
D.T.
and
Kreinovich
,
V.
(Eds),
Artificial Intelligence and Machine Learning for Econometrics: Applications, Regulation (and Related Topics)
,
Springer
,
Studies in Systems, Decision, and Control series
.
Jedra
,
Y.
and
Proutière
,
A.
(
2019
), “
Sample complexity lower bounds for linear system identification
”,
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS 2019)
, pp. 
2397
-
2405
.
Ljung
,
L.
(
1976a
), “
Consistency of the least-squares identification method
”,
IEEE Transactions on Automatic Control
, Vol. 
21
No. 
5
, pp. 
779
-
781
, doi: .
Ljung
,
L.
(
1976
b), “On the consistency of prediction error identification methods”, in
Mehra
,
R.K.
and
Lainiotis
,
D.G.
(Eds),
Mathematics in Science and Engineering
,
Elsevier
, Vol. 
126
, pp. 
121
-
164
, doi: .
Ljung
,
L.
(
1986
),
System Identification: Theory for the User
,
Prentice-Hall
,
[PubMed]
.
Oymak
,
S.
and
Ozay
,
N.
(
2019
), “
Non-asymptotic identification of LTI systems from a single trajectory
”,
IEEE Transactions on Automatic Control
, Vol. 
64
No. 
6
, pp. 
2345
-
2355
.
Simchowitz
,
M.
,
Mania
,
H.
,
Tu
,
S.
,
Jordan
,
M.I.
and
Recht
,
B.
(
2018
), “
Learning without mixing: towards a sharp analysis of linear system identification
”,
Proceedings of the 31st Conference on Learning Theory (COLT 2018)
, pp. 
1298
-
1320
.
Published in Asian Journal of Economics and Banking. Published by Emerald Publishing Limited. This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at http://creativecommons.org/licences/by/4.0/legalcode

or Create an Account

Close Modal
Close Modal