Compressible signals¶
In this section, we first look at some general results and definitions related to \(K\)-term approximations of arbitrary signals \(x \in \CC^N\). We then define the notion of a compressible signal and study properties related to it.
K-term approximation of general signals¶
Let \(x \in \CC^N\). Let \(T \subset \{ 1, 2, \dots, N\}\) be any index set.Further let
such that
Let \(x_T \in \CC^{|T|}\) be defined as
Then \(x_T\) is a restriction of the signal \(x\) on the index set \(T\).
Alternatively let \(x_T \in \CC^N\) be defined as
In other words, \(x_T \in \CC^N\) keeps the entries in \(x\) indexed by \(T\) while sets all other entries to 0. Then we say that \(x_T\) is obtained by masking \(x\) with \(T\). As an abuse of notation, we will use any of the two definitions whenever we are referring to \(x_T\). The definition being used should be obvious from the context.
Let
Let
Then
Since \(|T| = 4\), sometimes we will also write
Let \(x \in \CC^N\) be an arbitrary signal. Consider any index set \(T \subset \{1, \dots, N \}\) with \(|T| = K\). Then \(x_T\) is a \(K\)-term approximation of \(x\).
Clearly for any \(x \in \CC^N\) there are \(\binom{N}{K}\) possible \(K\)-term approximations of \(x\).
Let
Let \(T= \{ 1, 6 \}\). Then
is a \(2\)-term approximation of \(x\).
If we choose \(T= \{7,8,9,10\}\), the corresponding \(4\)-term approximation of \(x\) is
Let \(x \in \CC^N\) be an arbitrary signal. Let \(\lambda_1, \dots, \lambda_N\) be indices of entries in \(x\) such that
In case of ties, the order is resolved lexicographically, i.e. if \(|x_i| = |x_j|\) and \(i < j\) then \(i\) will appear first in the sequence \(\lambda_k\).
Consider the index set \(\Lambda_K = \{ \lambda_1, \lambda_2, \dots, \lambda_K\}\). The restriction of \(x\) on \(\Lambda_K\) given by \(x_{\Lambda_K}\) (see above) contains the \(K\) largest entries \(x\) while setting all other entries to 0. This is known as the \(K\) largest entries approximation of \(x\).
This signal is denoted henceforth as \(x|_K\). i.e.
where \(\Lambda_K\) is the index set corresponding to \(K\) largest entries in \(x\) (magnitude wise).
Let
Then
All further \(K\) largest entries approximations are same as \(x\).
A pertinent question at this point is: which \(K\)-term approximation of \(x\) is the best \(K\)-term approximation? Certainly in order to compare two approximations we need some criterion. Let us choose \(l_p\) norm as the criterion. The next lemma gives an interesting result for best \(K\)-term approximations in \(l_p\) norm sense.
Let \(x \in \CC^N\). Let the best \(K\) term approximation of \(x\) be obtained by the following optimization program:
where \(p \in [1, \infty]\).
Let an optimal solution for this optimization problem be denoted by \(x_{T^*}\). Then
i.e. the \(K\)-largest entries approximation of \(x\) is an optimal solution to (3) .
For \(p=\infty\), the result is obvious. In the following, we focus on \(p \in [1, \infty)\).
We note that maximizing \(\| x_T \|_p\) is equivalent to maximizing \(\| x_T \|^p_p\).
Let \(\lambda_1, \dots, \lambda_N\) be indices of entries in \(x\) such that
Further let \(\{ \omega_1, \dots, \omega_N\}\) be any permutation of \(\{1, \dots, N \}\).
Clearly
Thus if \(T^*\) corresponds to an optimal solution of (3) then
Thus \(x|_K\) is an optimal solution to (3) .
This lemma helps us establish that whenever we are looking for a best \(K\)-term approximation of \(x\) under any \(l_p\) norm, all we have to do is to pickup the \(K\)-largest entries in \(x\).
Let \(\Phi \in \CC^{M \times N}\). Let \(T \subset \{ 1, 2, \dots, N\}\) be any index set.Further let
such that
Let \(\Phi_T \in \CC^{M \times |T|}\) be defined as
Then \(\Phi_T\) is a restriction of the matrix \(\Phi\) on the index set \(T\).
Alternatively let \(\Phi_T \in \CC^{M \times N}\) be defined as
In other words, \(\Phi_T \in \CC^{M \times N}\) keeps the columns in \(\Phi\) indexed by \(T\) while sets all other columns to 0. Then we say that \(\Phi_T\) is obtained by masking \(\Phi\) with \(T\). As an abuse of notation, we will use any of the two definitions whenever we are referring to \(\Phi_T\). The definition being used should be obvious from the context.
Let \(\supp(x) = \Lambda\). Then
Let \(S\) and \(T\) be two disjoint index sets such that for some \(x \in \CC^N\)
using the mask version of \(x_T\) notation. Then the following holds
Straightforward application of previous result:
Let \(T\) be any index set. Let \(\Phi \in \CC^{M \times N}\) and \(y \in \CC^M\). Then
Now let
Then
Compressible signals¶
We will now define the notion of a compressible signal in terms of the decay rate of magnitude of its entries when sorted in descending order.
Let \(x \in \CC^N\) be an arbitrary signal. Let \(\lambda_1, \dots, \lambda_N\) be indices of entries in \(x\) such that
In case of ties, the order is resolved lexicographically, i.e. if \(|x_i| = |x_j|\) and \(i < j\) then \(i\) will appear first in the sequence \(\lambda_k\). Define
The signal \(x\) is called \(p\)-compressible with magnitude \(R\) if there exists \(p \in (0, 1]\) such that
Let \(x\) be be \(p\)-compressible with \(p=1\). Then
Recalling \(\widehat{x}\) from (6) it’s straightforward to see that
since the \(l_1\) norm doesn’t depend on the ordering of entries in \(x\).
Now since \(x\) is \(1\)-compressible, hence from (7) we have
This gives us
The sum on the R.H.S. is the \(N\)-th Harmonic number (sum of reciprocals of first \(N\) natural numbers). A simple upper bound on Harmonic numbers is
This completes the proof.
We now demonstrate how a compressible signal is well approximated by a sparse signal.
Let \(x\) be a \(p\)-compressible signal and let \(x|_K\) be its best \(K\)-term approximation. Then the \(l_1\) norm of approximation error satisfies
with
Moreover the \(l_2\) norm of approximation error satisfies
with
We now approximate the R.H.S. sum with an integral.
Now
We can similarly show the result for \(l_2\) norm.