how does multiplication differ for NumPy Matrix vs Array classes?

python arrays numpy matrix matrix-multiplication

245,749

Solution 1

In 3.5, Python finally got a matrix multiplication operator. The syntax is a @ b.

Solution 2

The main reason to avoid using the matrix class is that a) it's inherently 2-dimensional, and b) there's additional overhead compared to a "normal" numpy array. If all you're doing is linear algebra, then by all means, feel free to use the matrix class... Personally I find it more trouble than it's worth, though.

For arrays (prior to Python 3.5), use dot instead of matrixmultiply.

E.g.

import numpy as np
x = np.arange(9).reshape((3,3))
y = np.arange(3)

print np.dot(x,y)

Or in newer versions of numpy, simply use x.dot(y)

Personally, I find it much more readable than the * operator implying matrix multiplication...

For arrays in Python 3.5, use x @ y.

Solution 3

the key things to know for operations on NumPy arrays versus operations on NumPy matrices are:

NumPy matrix is a subclass of NumPy array
NumPy array operations are element-wise (once broadcasting is accounted for)
NumPy matrix operations follow the ordinary rules of linear algebra

some code snippets to illustrate:

>>> from numpy import linalg as LA
>>> import numpy as NP

>>> a1 = NP.matrix("4 3 5; 6 7 8; 1 3 13; 7 21 9")
>>> a1
matrix([[ 4,  3,  5],
        [ 6,  7,  8],
        [ 1,  3, 13],
        [ 7, 21,  9]])

>>> a2 = NP.matrix("7 8 15; 5 3 11; 7 4 9; 6 15 4")
>>> a2
matrix([[ 7,  8, 15],
        [ 5,  3, 11],
        [ 7,  4,  9],
        [ 6, 15,  4]])

>>> a1.shape
(4, 3)

>>> a2.shape
(4, 3)

>>> a2t = a2.T
>>> a2t.shape
(3, 4)

>>> a1 * a2t         # same as NP.dot(a1, a2t) 
matrix([[127,  84,  85,  89],
        [218, 139, 142, 173],
        [226, 157, 136, 103],
        [352, 197, 214, 393]])

but this operations fails if these two NumPy matrices are converted to arrays:

>>> a1 = NP.array(a1)
>>> a2t = NP.array(a2t)

>>> a1 * a2t
Traceback (most recent call last):
   File "<pyshell#277>", line 1, in <module>
   a1 * a2t
   ValueError: operands could not be broadcast together with shapes (4,3) (3,4)

though using the NP.dot syntax works with arrays; this operations works like matrix multiplication:

>> NP.dot(a1, a2t)
array([[127,  84,  85,  89],
       [218, 139, 142, 173],
       [226, 157, 136, 103],
       [352, 197, 214, 393]])

so do you ever need a NumPy matrix? ie, will a NumPy array suffice for linear algebra computation (provided you know the correct syntax, ie, NP.dot)?

the rule seems to be that if the arguments (arrays) have shapes (m x n) compatible with the a given linear algebra operation, then you are ok, otherwise, NumPy throws.

the only exception i have come across (there are likely others) is calculating matrix inverse.

below are snippets in which i have called a pure linear algebra operation (in fact, from Numpy's Linear Algebra module) and passed in a NumPy array

determinant of an array:

>>> m = NP.random.randint(0, 10, 16).reshape(4, 4)
>>> m
array([[6, 2, 5, 2],
       [8, 5, 1, 6],
       [5, 9, 7, 5],
       [0, 5, 6, 7]])

>>> type(m)
<type 'numpy.ndarray'>

>>> md = LA.det(m)
>>> md
1772.9999999999995

eigenvectors/eigenvalue pairs:

>>> LA.eig(m)
(array([ 19.703+0.j   ,   0.097+4.198j,   0.097-4.198j,   5.103+0.j   ]), 
array([[-0.374+0.j   , -0.091+0.278j, -0.091-0.278j, -0.574+0.j   ],
       [-0.446+0.j   ,  0.671+0.j   ,  0.671+0.j   , -0.084+0.j   ],
       [-0.654+0.j   , -0.239-0.476j, -0.239+0.476j, -0.181+0.j   ],
       [-0.484+0.j   , -0.387+0.178j, -0.387-0.178j,  0.794+0.j   ]]))

matrix norm:

>>>> LA.norm(m)
22.0227

qr factorization:

>>> LA.qr(a1)
(array([[ 0.5,  0.5,  0.5],
        [ 0.5,  0.5, -0.5],
        [ 0.5, -0.5,  0.5],
        [ 0.5, -0.5, -0.5]]), 
 array([[ 6.,  6.,  6.],
        [ 0.,  0.,  0.],
        [ 0.,  0.,  0.]]))

matrix rank:

>>> m = NP.random.rand(40).reshape(8, 5)
>>> m
array([[ 0.545,  0.459,  0.601,  0.34 ,  0.778],
       [ 0.799,  0.047,  0.699,  0.907,  0.381],
       [ 0.004,  0.136,  0.819,  0.647,  0.892],
       [ 0.062,  0.389,  0.183,  0.289,  0.809],
       [ 0.539,  0.213,  0.805,  0.61 ,  0.677],
       [ 0.269,  0.071,  0.377,  0.25 ,  0.692],
       [ 0.274,  0.206,  0.655,  0.062,  0.229],
       [ 0.397,  0.115,  0.083,  0.19 ,  0.701]])
>>> LA.matrix_rank(m)
5

matrix condition:

>>> a1 = NP.random.randint(1, 10, 12).reshape(4, 3)
>>> LA.cond(a1)
5.7093446189400954

inversion requires a NumPy matrix though:

>>> a1 = NP.matrix(a1)
>>> type(a1)
<class 'numpy.matrixlib.defmatrix.matrix'>

>>> a1.I
matrix([[ 0.028,  0.028,  0.028,  0.028],
        [ 0.028,  0.028,  0.028,  0.028],
        [ 0.028,  0.028,  0.028,  0.028]])
>>> a1 = NP.array(a1)
>>> a1.I

Traceback (most recent call last):
   File "<pyshell#230>", line 1, in <module>
   a1.I
   AttributeError: 'numpy.ndarray' object has no attribute 'I'

but the Moore-Penrose pseudoinverse seems to works just fine

>>> LA.pinv(m)
matrix([[ 0.314,  0.407, -1.008, -0.553,  0.131,  0.373,  0.217,  0.785],
        [ 1.393,  0.084, -0.605,  1.777, -0.054, -1.658,  0.069, -1.203],
        [-0.042, -0.355,  0.494, -0.729,  0.292,  0.252,  1.079, -0.432],
        [-0.18 ,  1.068,  0.396,  0.895, -0.003, -0.896, -1.115, -0.666],
        [-0.224, -0.479,  0.303, -0.079, -0.066,  0.872, -0.175,  0.901]])

>>> m = NP.array(m)

>>> LA.pinv(m)
array([[ 0.314,  0.407, -1.008, -0.553,  0.131,  0.373,  0.217,  0.785],
       [ 1.393,  0.084, -0.605,  1.777, -0.054, -1.658,  0.069, -1.203],
       [-0.042, -0.355,  0.494, -0.729,  0.292,  0.252,  1.079, -0.432],
       [-0.18 ,  1.068,  0.396,  0.895, -0.003, -0.896, -1.115, -0.666],
       [-0.224, -0.479,  0.303, -0.079, -0.066,  0.872, -0.175,  0.901]])

Solution 4

There is a situation where the dot operator will give different answers when dealing with arrays as with dealing with matrices. For example, suppose the following:

>>> a=numpy.array([1, 2, 3])
>>> b=numpy.array([1, 2, 3])

Lets convert them into matrices:

>>> am=numpy.mat(a)
>>> bm=numpy.mat(b)

Now, we can see a different output for the two cases:

>>> print numpy.dot(a.T, b)
14
>>> print am.T*bm
[[1.  2.  3.]
 [2.  4.  6.]
 [3.  6.  9.]]

Solution 5

Reference from http://docs.scipy.org/doc/scipy/reference/tutorial/linalg.html

..., the use of the numpy.matrix class is discouraged, since it adds nothing that cannot be accomplished with 2D numpy.ndarray objects, and may lead to a confusion of which class is being used. For example,

>>> import numpy as np
>>> from scipy import linalg
>>> A = np.array([[1,2],[3,4]])
>>> A
    array([[1, 2],
           [3, 4]])
>>> linalg.inv(A)
array([[-2. ,  1. ],
      [ 1.5, -0.5]])
>>> b = np.array([[5,6]]) #2D array
>>> b
array([[5, 6]])
>>> b.T
array([[5],
      [6]])
>>> A*b #not matrix multiplication!
array([[ 5, 12],
      [15, 24]])
>>> A.dot(b.T) #matrix multiplication
array([[17],
      [39]])
>>> b = np.array([5,6]) #1D array
>>> b
array([5, 6])
>>> b.T  #not matrix transpose!
array([5, 6])
>>> A.dot(b)  #does not matter for multiplication
array([17, 39])

scipy.linalg operations can be applied equally to numpy.matrix or to 2D numpy.ndarray objects.

View more solutions

245,749

Author by

elexhobby

Updated on July 08, 2022

Comments

elexhobby almost 2 years

The numpy docs recommend using array instead of matrix for working with matrices. However, unlike octave (which I was using till recently), * doesn't perform matrix multiplication, you need to use the function matrixmultipy(). I feel this makes the code very unreadable.

Does anybody share my views, and has found a solution?
- wheaties over 13 years
  
  You're asking for opinions and not a question. Is there something more specific we could help you with or perhaps guide you in making it more readable?
- Matti Pastell over 13 years
  
  Actually the docs recommend using matrix if you do linear algebra and don't wan't to use multiply() so whats the problem?
- elexhobby over 13 years
  
  I haven't gone through the docs in detail. Just curious, what advantages do arrays offer over the matrix class? I found that arrays do not differentiate between rows and columns. Is it because arrays are supposed to be thought of as tensors rather than matrices? As Joe pointed out, the fact that matrix class is 2-dim is quite limiting. What's the thinking behind this kind of design, as in, why not have a single matrix class like matlab/octave?
- Charlie Parker over 6 years
  
  I guess the main issue is that python doesn't have .* vs '*' syntax for element wise vs matrix multiplication. If it had that then it would all be simpler though I'm surprised they choose * to mean element-wise and not matrix multiplication.
elexhobby over 13 years

Its unreadable when you have a stack of multiplications, for instance x'A'*Ax.
Joe Kington over 13 years

@elexhobby - x.T.dot(A.T).dot(A).dot(x) isn't that unreadable, i.m.o. To each his own, though. If you're primarily doing matrix multiplication, then by all means, use numpy.matrix!
amcnabb about 11 years

By the way, why is matrix multiplication called "dot"? In what sense is it a dot product?
Joe Kington about 11 years

@amcnabb - Matrix multiplication is sometimes referred to as a "dot product" in textbooks (in those books, the dot product you're thinking of is called a "scalar product" or "scalar dot product"). The scalar dot product is just matrix multiplication of two vectors, after all, so using "dot" to mean matrix multiplication in general isn't much of a stretch. That particular notation seems (?) more common in engineering and science texts than in mathematics, at least in my experience. Its prevalence in numpy is mostly because numpy.matrixmultiply is tough to type.
Joe Kington about 11 years

@amcnabb - Also, matrixmultiply was depreciated many years ago, and has been removed in any recent-ish (>v1.3, maybe?) version of numpy.
amcnabb about 11 years

@JoeKington - Is the scalar dot product really just matrix multiplication of two vectors? If the * operator is matrix multiplication, then u.T * v would be the scalar dot product, but u * v would be undefined (since the dimensions don't match). If matrix multiplication is defined as a sum over the last axis of the left array with the second-to-last axis of the right array, then I would assume that matrix multiplication of 1-D numpy arrays would fail because the right array does not have a second-to-last axis. Anyway, that's why I'm confused.
Joe Kington about 11 years

@amcnabb - numpy.dot is special-cased to behave as inner for 1d vectors. The inner product of two 1D vectors is strictly identical to the scalar dot product. (Incidentally, u.T and u are identical if u is one-dimensional. "Row vectors" and "column vectors" are 2D, and aren't vectors, strictly speaking. In numpy, vectors are vectors, and don't have a 2nd dimension, so you have to reshape them into 2D to have a row vector or column vector.)
amcnabb about 11 years

@JoeKington - I appreciate that numpy.dot behaves as an inner product for one-dimensional vectors, but this special casing makes the function seem self-inconsistent. Maybe I would be less confused if it were more common for books to refer to "dot" for matrix multiplication as you describe. Anyway, I understand it enough to use it, but I hate the obligation I feel to litter code with comments explaining that numpy.dot performs matrix multiplication.
Henry Gomersall about 11 years

@amcnabb the point is that dot generalizes to arbitrary dimensionality without ambiguity. It is this that makes numpy.dot equivalent to matrix multiplication. If you really dislike the notation, use the matrix class.
elexhobby over 9 years

Thanks! Yay, glad to see that I'm not the only one who feels that the current notation is unreadable.
db1234 over 9 years

mInv = NP.linalg.inv(m) computes the inverse of an array
Neil G over 9 years

Hope you don't mind I updated your answer for Python 3.5
Scientist1642 over 8 years

For me dot also seems unnatural name, and it was confusing at first.
Minh Triet over 8 years

An important point to note here is * is element-wise multiplication, dot is the true matrix multiplication. Please see stackoverflow.com/a/18255635/1780570
Minh Triet over 8 years

To be specific, * is element-wise multiplication, dot is the true matrix multiplication. Please see stackoverflow.com/a/18255635/1780570
Minh Triet over 8 years

An important point to note here is * is element-wise multiplication, dot is the true matrix multiplication. Please see stackoverflow.com/a/18255635/1780570
patapouf_ai almost 8 years

That is because as a numpy array, a.T == a, the transpose doesnt do anything.
patapouf_ai almost 8 years

If you write at = np.array([[1],[2],[3]]), then numpy.dot(at,b) should give you same. The difference between matix and array is not in the dot but in the transpose.
patapouf_ai almost 8 years

Or actually, if you write a = numpy.array([[1,2,3]]) then a.T will really transpose and everything will work just like in matrices.
HelloGoodbye almost 7 years

@amcnabb It's true that the matrix multiplication and the scalar product between two vectors are performed very similarly; however, the scalar product between two vectors commute, while the the matrix multiplication do not.
Charlie Parker over 6 years

I guess the main issue is that python doesn't have .* vs '*' syntax for element wise vs matrix multiplication. If it had that then it would all be simpler though I'm surprised they choose * to mean element-wise and not matrix multiplication.
schrödinbug almost 5 years

using dot for matrix multiplication is TERRIBLE notation since the vector dot product and matrix multiple are very different operations. For one instance, the dot product of two vectors is commutative, i.e. a.dot(b) == b.dot(a) (or at least should be per the definition of the dot product). In matrix multiplication np.matmul(a,b) != np.matmul(b,a)
HopeKing almost 4 years

IMP note: numpy matrices are to be avoided in favor of arrays. Note from documentation --> "It is no longer recommended to use this class, even for linear algebra. Instead use regular arrays. The class may be removed in the future." See also stackoverflow.com/a/61156350/6043669
Joe Huang about 2 years

[[1,1], [1,2]] @ [[75,0], [0,64]], unsupported operand type(s) for @: 'list' and 'list', python 3.9.