The Least Norm Solution of Under Determined Linear System

Question

Suppose I have a matrix

$$A= \begin{pmatrix} 1 & 0 & 1 & 0\\ 0 & 1 & 1& 0\\ \end{pmatrix} $$

where the variables are channel information like assume $X_1$, $X_2$, $X_3$ and $X_4$. Considering the typical least-squares problem

$$Y= AX+b$$

However, matrix $A$ is fat and, thus, $A^{T} A$ is not invertible. I cannot get $X_3$ or $X_4$ and they will be zero. I can use regularization and optimize joint least-squares and least-norm problem but that does not give me the same $X_1$ and $X_2$. Is there any way to solve this problem?

I tried to stack all the received vectors into matrix $Y$ and do the same for $A$ and $b$ to make matrix $A$ fat. However, due to the structure of $A$, it is still not invertible. Does anyone know how to solve this problem? Can I recover the solution for parameters based on what I get from regularized solution?

There might be a answer if we can assume the solution is sparse (is mostly zero) — MimSaad, Jul 24 '17 at 05:15
Shouldn't $X_k$ be in a discrete set, say, ${\pm 1}$, or ${0,1}$? — Rodrigo de Azevedo, Jul 24 '17 at 08:42

score 2 · Answer 1 · answered Aug 01 '17 at 00:44

It means you will have a non-unique solution or redundancy. In your formulation, $X_4$ is completely free and $X_3$ is an offset parameter. You can deflate your matrix and obtain the full rank part to eliminate the unconstrained part.

In your example the solution is the family of

$$ \begin{bmatrix} X_1\\X_2 \end{bmatrix} = b - \begin{bmatrix} X_3\\X_3\end{bmatrix}$$

You have a degenerate case in which one of the variables is completely free but similar problems are pretty common in robotics with overactuated manipulators and so on for which you optimize within all the possible solutions.

score 1 · Answer 2 · answered Jul 23 '17 at 09:34

1

Is there anyway to solve this problem

No, not really. Having such a matrix of reason-observation values being ill-formed simply is that you don't have more information. You cannot spawn information from nowhere.

answered Jul 23 '17 at 09:34

Marcus Müller

30,525
4
34
58

The Least Squares solution (Which is also the Laast Norm Solution) always exists for any Linear Equation System. – Royi Aug 01 '17 at 06:02

score 1 · Answer 3 · edited Jul 24 '17 at 14:05

1

As it can be observed, if $b=0$ , since $A$ is non square matrix. Thus inverse cannot exist.

Observe that the pseudo-inverse $A^TA$ does not exist, due to non independent columns.

The only way that the problem can be addressed is to introduce additional conditions on the signal such as sparsity. If $X$ has 2 or fewer non zero coefficients then there is a possible solution.

edited Jul 24 '17 at 14:05

Peter K.

25,714
9
46
91

answered Jul 23 '17 at 10:38

AnVij

104
5

Your answer has a contradiction with the fact that a least squares solution minimizes squared Euclidean norm of the residual error Ax+b-y (which always has a solution). – msm Jul 23 '17 at 11:23
1

The pseudoinverse of $\rm A^\top A$ certainly does exist. – Rodrigo de Azevedo Jul 24 '17 at 08:18
As Rodrigo says, the Moore-Penrose pseudo inverse will always exist. – Peter K. Jul 24 '17 at 14:06

score 1 · Answer 4 · edited Jun 17 '20 at 08:24

1

Least Squares solution is always well defined for Linear System of Equations.

In your case, which is under determined it means there are many solutions to the Linear Equations.
The Least Squares solution has nice property, it also minimizes the $ {L}_{2} $ norm of the solution (Least Norm Solution) hence it is well defined.

In practice, in order to solve this equation you need to use the pinv() function in MATLAB.
This function uses the Moore Penrose Pseudo Inverse Matrix of $ A $ to solve the equation:

x = pinv(A) * b

It basically decompose the Matrix using the Singular Value Decomposition (SVD) such that $ U S {V}^{T} = A $ and the rebuild the matrix $ {A}^{\dagger} = U {S}^{\dagger} {V}^{T} $ where $ {S}^{\dagger} $ is the result of inverting any non zero element of the diagonal matrix $ S $.

Update

The user @percusse said the following:

You just need A\b

You shouldn't use pinv for linear system solutions that's what I mean. Not only inefficient but also not accurate. There is no unique solution hence you can't talk about a least.

Backslash operator is optimized for all possible linear eq sets. Try it.

There is no least squares solutions for underdetermined systems because of the nontrivial nullspace. Anyways nevermind.

So, to make things clear.
The Least Squares solution always exists and it is also the Least Norm solution.
Namely, from the set $ \mathcal{B} = \left\{ x \mid \min_{x} \left\| A x - b \right\| \right\} $ there is one unique $ \hat{x} $ such that $ \hat{x} = \arg \min_{x} \left\| x \right\|_{2} \text{ subject to } \left\{ x \mid x \in \mathcal{B} \right\} $.

For instance:

for ii = 1:1e12
    numRows = 3;
    numCols = 27; %<! Must be bigger than 'numRows' for Under Determined System
mA   = randn([numRows, numCols]);
vB   = randn([numRows, 1]);
vXLs = pinv(mA) * vB;
vX   = mA \ vB;


if(norm(vXLs) &gt;= norm(vX))
    keyboard();
end

if(mod(ii, 1e4) == 0)
    disp(['Iteration Number - ', num2str(ii)]);
end


end

Remarks:

The difference arises for Under Determined System. For over / well determined system the solutions collide (Namely Tall Matrices which are the "Classic" case in LS problems).
The Psuedo Inverse is much slower to use then the \ operator. If you're looking for any solution, use \. If you're looking for the Least Squares solution (The Least Squares solution is always the least norm solution) for Under Determined System you need to use the pinv() function.
Proof and deeper analysis can be found in my Singular Value Decomposition (SVD) Presentation. Specifically this issue is on pages 50-60.
Cleve Moler's (Inventor of MATLAB) post on Pseudo Inverse vs. \ - The World’s Simplest Impossible Problem.

edited Jun 17 '20 at 08:24

Community

1

answered Aug 01 '17 at 05:56

Royi

19,608
4
197
238

You just need A\b – percusse Aug 01 '17 at 07:53
@percusse, No. That's wrong. This is not the Least Norm solution (At least not guaranteed to be) and hence not the Least Squares solution. This is a common mistake people do. – Royi Aug 01 '17 at 15:26
You shouldn't use pinv for linear system solutions that's what I mean. Not only inefficient but also not accurate. There is no unique solution hence you can't talk about a least. – percusse Aug 01 '17 at 15:33
In the case you look for LS solution in under determined system this is the only way to go. This is what the OP asked for. Of course that for full rank system you should use \. – Royi Aug 01 '17 at 15:38
Backslash operator is optimized for all possible linear eq sets. Try it. – percusse Aug 01 '17 at 15:44
@percusse, You're just wrong. For under determined systems it won't yield the least squares solution. You should try it :-). – Royi Aug 01 '17 at 15:45
There is no least squares solutions for underdetermined systems because of the nontrivial nullspace. Anyways nevermind. – percusse Aug 01 '17 at 15:47
@percusse, That's also not true. The LS solution is the one which has the least norm of all solutions (Infinite set). It is unique and always exists. Namely form the set of all solutions, there is one with the least norm - This is the LS Solution. It's easy to prove if you know the SVD. – Royi Aug 01 '17 at 15:51
If you say so, but don't come back and continue arguing if I run your code in a for loop for 100 times and half of the time the result is a logical 0 :) – percusse Aug 01 '17 at 16:19
@percusse, If you give me one (Where mA is defined as above) we'll both be written in history :-) (Well, this is the same as choosing exactly 0.93 if you have random number generator which is uniform on [0, 1], you basically choose one specific vector from the Null Space of $ A $, You gamble the \ operator to do so). Again, it is easy to prove using the SVD properties. If I have time later I will post the proof. – Royi Aug 01 '17 at 16:44
If you can just stop for a second and read what I'm saying you'll see the problem in the whole discussion but as you wish. Just put your code in a loop and run it a few times. – percusse Aug 01 '17 at 17:01
@percusse, I ran the above code (Updated for stopping if the condition is wrong). I'm on 2e6 and still no keyboard. Again, I will post the proof to show you the Least Squares is unique and must obey least norm. – Royi Aug 01 '17 at 17:49
You are almost there :) – percusse Aug 01 '17 at 17:58
I will let you stress your electricity bill to prove the opposite. Enjoy... – Royi Aug 01 '17 at 18:04
Cleve Moler's (Inventor of MATLAB) post on Pseudo Inverse vs. \\ - The World’s Simplest Impossible Problem. – Royi Jul 14 '19 at 03:33

score 1 · Answer 5 · edited Jun 17 '20 at 08:24

Since this question has an information-theoretic flavor, let us work over finite field $\mathbb F_2$. Rephrasing:

Given $$\mathrm A = \begin{bmatrix} 1 & 0 & 1 & 0\\ 0 & 1 & 1 & 0\end{bmatrix}$$ and vector $\mathrm y \in \mathbb F_2^2$, find a vector $\mathrm x \in \mathbb F_2^4$ of minimal Hamming weight such that $\rm A x = y$.

Example

Suppose we are given

$$\mathrm y = \begin{bmatrix} 1\\ 1\end{bmatrix}$$

which is the 3rd column of $\rm A$ and also the sum of the 1st and 2nd columns. The solution set then has only two elements

$$\mathrm x \in \left\{ \begin{bmatrix} 0\\ 0\\ 1\\ 0\end{bmatrix}, \begin{bmatrix} 1\\ 1\\ 0\\ 0\end{bmatrix} \right\}$$

The former has Hamming weight $1$, while the latter has Hamming weight $2$.

Fortunately, matrix $\rm A$ is already in reduced row echelon form (RREF)

$$\rm A = \begin{bmatrix} \mathrm I_2 & \mathrm F\end{bmatrix}$$

and, thus, the solution set of the linear system $\rm A x = y$ is parameterized as follows

$$\mathrm x \in \left\{ \begin{bmatrix} \mathrm y\\ 0_2\end{bmatrix} + \begin{bmatrix} \mathrm F\\ \mathrm I_2\end{bmatrix} \mathrm z : \mathrm z \in \mathbb F_2^2 \right\}$$

Only $x_4$ depends on $z_2$. In order to avoid increasing the Hamming weight of $\mathrm x$ unnecessarily, let us choose $x_4 = z_2 = 0$. The solution set of the linear system $\rm A x = y$ is given by

$$\mathrm x \in \left\{ \begin{bmatrix} y_1\\ y_2\\ 0\\ 0\end{bmatrix} + z_1 \begin{bmatrix} 1\\ 1\\ 1\\ 0\end{bmatrix} : z_1 \in \mathbb F_2 \right\} = \left\{ \begin{bmatrix} y_1\\ y_2\\ 0\\ 0\end{bmatrix}, \begin{bmatrix} y_1+1\\ y_2+1\\ 1\\ 0\end{bmatrix} \right\} =: \mathcal X (\mathrm y)$$

Thus, the solution of minimal Hamming weight is

$$\mathrm x^* := \arg \min_{\mathrm x \in \mathcal X (\mathrm y)} w (\mathrm x)$$

where function $w : \mathbb F_2^4 \to \mathbb N$ returns the Hamming weight of its input. To summarize, we find the optimal solution by comparing the Hamming weights of two $4$-dimensional vectors. That is all!

score 0 · Answer 6 · answered Jul 25 '17 at 22:00

The given matrix A is of rank 2, the third and the fourth columns are linear combinations of the first and second columns. So if the data matrix X is of rank lesser than or equal to the rank of A (i.e 2), you can recover the matrix X.

To know more about problems like this one, please read up on Orthogonal Matching Pursuit and Compressed sensing Matching Pursuit algorithms.

The Least Norm Solution of Under Determined Linear System

6 Answers6

Update

Example