Discover millions of ebooks, audiobooks, and so much more with a free trial

Only $11.99/month after trial. Cancel anytime.

Linear Algebra and Its Applications
Linear Algebra and Its Applications
Linear Algebra and Its Applications
Ebook486 pages5 hours

Linear Algebra and Its Applications

Rating: 2.5 out of 5 stars

2.5/5

()

Read preview

About this ebook

Praise for the First Edition

". . .recommended for the teacher and researcher as well as for graduate students. In fact, [it] has a place on every mathematician's bookshelf." -American Mathematical Monthly

Linear Algebra and Its Applications, Second Edition presents linear algebra as the theory and practice of linear spaces and linear maps with a unique focus on the analytical aspects as well as the numerous applications of the subject. In addition to thorough coverage of linear equations, matrices, vector spaces, game theory, and numerical analysis, the Second Edition features student-friendly additions that enhance the book's accessibility, including expanded topical coverage in the early chapters, additional exercises, and solutions to selected problems.

Beginning chapters are devoted to the abstract structure of finite dimensional vector spaces, and subsequent chapters address convexity and the duality theorem as well as describe the basics of normed linear spaces and linear maps between normed spaces.

Further updates and revisions have been included to reflect the most up-to-date coverage of the topic, including:
  • The QR algorithm for finding the eigenvalues of a self-adjoint matrix
  • The Householder algorithm for turning self-adjoint matrices into tridiagonal form
  • The compactness of the unit ball as a criterion of finite dimensionality of a normed linear space
Additionally, eight new appendices have been added and cover topics such as: the Fast Fourier Transform; the spectral radius theorem; the Lorentz group; the compactness criterion for finite dimensionality; the characterization of commentators; proof of Liapunov's stability criterion; the construction of the Jordan Canonical form of matrices; and Carl Pearcy's elegant proof of Halmos' conjecture about the numerical range of matrices.

Clear, concise, and superbly organized, Linear Algebra and Its Applications, Second Edition serves as an excellent text for advanced undergraduate- and graduate-level courses in linear algebra. Its comprehensive treatment of the subject also makes it an ideal reference or self-study for industry professionals.
LanguageEnglish
Release dateMay 20, 2013
ISBN9781118626924
Linear Algebra and Its Applications

Read more from Peter D. Lax

Related to Linear Algebra and Its Applications

Titles in the series (38)

View More

Related ebooks

Mathematics For You

View More

Related articles

Reviews for Linear Algebra and Its Applications

Rating: 2.5 out of 5 stars
2.5/5

2 ratings0 reviews

What did you think?

Tap to rate

Review must be at least 10 words

    Book preview

    Linear Algebra and Its Applications - Peter D. Lax

    Preface

    The outlook of this second edition is the same as that of the original: to present linear algebra as the theory and practice of linear spaces and linear mappings. Where it aids understanding and calculations, I don’t hesitate to describe vectors as arrays of numbers and to describe mappings as matrices. Render onto Caesar the things which are Caesar’s.

    If you can reduce a mathematical problem to a problem in linear algebra, you can most likely solve it, provided that you know enough linear algebra. Therefore, a thorough grounding in linear algebra is highly desirable. A sound undergraduate education should offer a second course on the subject, at the senior level. I wrote this book as a suitable text for such a course. The changes made in this second edition are partly to make it more suitable as a text. Terse descriptions, especially in the early chapters, were expanded, more problems were added, and a list of solutions to selected problems has been provided.

    In addition, quite a bit of new material has been added, such as the compactness of the unit ball as a criterion of finite dimensionality of a normed linear space. A new chapter discusses the QR algorithm for finding the eigenvalues of a self-adjoint matrix. The Householder algorithm for turning such matrices into tridiagonal form is presented. I describe in some detail the beautiful observation of Deift, Nanda, and Tomei of the analogy between the convergence of the QR algorithm and Moser’s theorem on the asymptotic behavior of the Toda flow as time tends to infinity.

    Eight new appendices have been added to the first edition’s original eight, including the Fast Fourier Transform, the spectral radius theorem, proved with the help of the Schur factorization of matrices, and an excursion into the theory of matrix-valued analytic functions. Appendix 11 describes the Lorentz group, 12 is an interesting application of the compactness criterion for finite dimensionality, 13 is a characterization of commutators, 14 presents a proof of Liapunov’s stability criterion, 15 presents the construction of the Jordan Canonical form of matrices, and 16 describes Carl Pearcy’s elegant proof of Halmos’ conjecture about the numerical range of matrices.

    I conclude with a plea to include the simplest aspects of linear algebra in high-school teaching: vectors with two and three components, the scalar product, the cross product, the description of rotations by matrices, and applications to geometry. Such modernization of the high-school curriculum is long overdue.

    I acknowledge with pleasure much help I have received from Ray Michalek, as well as useful conversations with Albert Novikoff and Charlie Peskin. I also would like to thank Roger Horn, Beresford Parlett, and Jerry Kazdan for very useful comments, and Jeffrey Ryan for help in proofreading.

    PETER D. LAX

    New York, New York

    Preface to the First Edition

    This book is based on a lecture course designed for entering graduate students and given over a number of years at the Courant Institute of New York University. The course is open also to qualified undergraduates and on occasion was attended by talented high school students, among them Alan Edelman; I am proud to have been the first to teach him linear algebra. But, apart from special cases, the book, like the course, is for an audience that has some—not much—familiarity with linear algebra.

    Fifty years ago, linear algebra was on its way out as a subject for research. Yet during the past five decades there has been an unprecedented outburst of new ideas about how to solve linear equations, carry out least square procedures, tackle systems of linear inequalities, and find eigenvalues of matrices. This outburst came in response to the opportunity created by the availability of ever faster computers with ever larger memories. Thus, linear algebra was thrust center stage in numerical mathematics. This had a profound effect, partly good, partly bad, on how the subject is taught today.

    The presentation of new numerical methods brought fresh and exciting material, as well as realistic new applications, to the classroom. Many students, after all, are in a linear algebra class only for the applications. On the other hand, bringing applications and algorithms to the foreground has obscured the structure of linear algebra—a trend I deplore; it does students a great disservice to exclude them from the paradise created by Emmy Noether and Emil Artin. One of the aims of this book is to redress this imbalance.

    My second aim in writing this book is to present a rich selection of analytical results and some of their applications: matrix inequalities, estimates for eigenvalues and determinants, and so on. This beautiful aspect of linera algebra, so useful for working analysts and physicists, is often neglected in texts.

    I strove to choose proofs that are revealing, elegant, and short. When there are two different ways of viewing a problem, I like to present both.

    The Contents describes what is in the book. Here I would like to explain my choice of materials and their treatment. The first four chapters describe the abstract theory of linear spaces and linear transformations. In the proofs I avoid elimination of the unknowns one by one, but use the linear structure; I particularly exploit quotient spaces as a counting device. This dry material is enlivened by some nontrivial applications to quadrature, to interpolation by polynomials, and to solving the Dirichlet problem for the discretized Laplace equation.

    In Chapter 5, determinants are motivated geometrically as signed volumes of ordered simplices. The basic algebraic properties of determinants follow immediately.

    Chapter 6 is devoted to the spectral theory of arbitrary square matrices with complex entries. The completeness of eigenvectors and generalized eigenvectors is proved without the characteristic equation, relying only on the divisibility theory of the algebra of polynomials. In the same spirit we show that two matrices A and B are similar if and only if (A — kI)m and (B — kI)m have nullspaces of the same dimension for all complex k and all positive integer m. The proof of this proposition leads to the Jordan canonical form.

    Euclidean structure appears for the first time in Chapter 7. It is used in Chapter 8 to derive the spectral theory of selfadjoint matrices. We present two proofs, one based on the spectral theory of general matrices, the other using the variational characterization of eigenvectors and eigenvalues. Fischer’s minmax theorem is explained.

    Chapter 9 deals with the calculus of vector- and matrix-valued functions of a single variable, an important topic not usually discussed in the undergraduate curriculum. The most important result is the continuous and differentiable character of eigenvalues and normalized eigenvectors of differentiable matrix functions, provided that appropriate nondegeneracy conditions are satisfied. The fascinating phenomenon of ‘‘avoided crossings’’ is briefly described and explained.

    The first nine chapters, or certainly the first eight, constitute the core of linear algebra. The next eight chapters deal with special topics, to be taken up depending on the interest of the instructor and of the students. We shall comment on them very briefly.

    Chapter 10 is a symphony of inequalities about matrices, their eigenvalues, and their determinants. Many of the proofs make use of calculus.

    I included Chapter 11 to make up for the unfortunate disappearance of mechanics from the curriculum and to show how matrices give an elegant description of motion in space. Angular velocity of a rigid body and divergence and curl of a vector field all appear naturally. The monotonic dependence of eigenvalues of symmetric matrices is used to show that the natural frequencies of a vibrating system increase if the system is stiffened and the masses are decreased.

    Chapter 12 we present the descriptions of convex sets in terms of gauge functions and support functions. The workhorse of the subject, the hyperplane separation theorem, is proved by means of the Hahn–Banach procedure. Carathéodory’s theorem on extreme points is proved and used to derive the König–Birkhoff theorem on doubly stochastic matrices; Helly’s theorem on the intersection of convex sets is stated and proved.

    Chapter 13 is on linear inequalities; the Farkas–Minkowski theorem is derived and used to prove the duality theorem, which then is applied in the usual fashion to a maximum–minimum problem in economics, and to the minmax theorem of von Neumann about two-person zero-sum games.

    Chapter 14 is on normed linear spaces; it is mostly standard fare except for a dual characterization of the distance of a point from a linear subspace. Linear mappings of normed linear spaces are discussed in Chapter 15.

    Chapter 16 presents Perron’s beautiful theorem on matrices all of whose entries are positive. The standard application to the asymptotics of Markov chains is described. In conclusion, the theorem of Frobenius about the eigenvalues of matrices with nonnegative entries is stated and proved.

    The last chapter discusses various strategies for solving iteratively systems of linear equations of the form Ax = b, A a self-adjoint, positive matrix. A variational formula is derived and a steepest descent method is analyzed. We go on to present several versions of iterations employing Chebyshev polynomials. Finally we describe the conjugate gradient method in terms of orthogonal polynomials.

    It is with genuine regret that I omit a chapter on the numerical calculation of eigenvalues of self-adjoint matrices. Astonishing connections have been discovered recently between this important subject and other seemingly unrelated topics.

    Eight appendices describe material that does not quite fit into the flow of the text, but that is so striking or so important that it is worth bringing to the attention of students. The topics I have chosen are special determinants that can be evaluated explicity, Pfaff’s theorem, symplectic matrices, tensor product, lattices, Strassen’s algorithm for fast matrix multiplication, Gershgorin’s theorem, and the multiplicity of eigenvalues. There are other equally attractive topics that could have been chosen: the Baker–Campbell–Hausdorff formula, the Kreiss matrix theorem, numerical range, and the inversion of tridiagonal matrices.

    Exercises are sprinkled throughout the text; a few of them are routine; most require some thinking and a few of them require some computing.

    My notation is neoclassical. I prefer to use four-letter Anglo-Saxon words like ‘‘into,’’ ‘‘onto’’ and ‘‘1-to-1,’’ rather than polysyllabic ones of Norman origin. The end of a proof is marked by an open square.

    The bibliography consists of the usual suspects and some recent texts; in addition, I have included Courant–Hilbert, Volume I, unchanged from the original German version in 1924. Several generations of mathematicians and physicists, including the author, first learned linear algebra from Chapter 1 of this source.

    I am grateful to my colleagues at the Courant Institute and to Myron Allen at the University of Wyoming for reading and commenting on the manuscript and for trying out parts of it on their classes. I am grateful to Connie Engle and Janice Want for their expert typing.

    I have learned a great deal from Richard Bellman’s outstanding book, Introduction to Matrix Analysis; its influence on the present volume is considerable. For this reason and to mark a friendship that began in 1945 and lasted until his death in 1984, I dedicate this book to his memory.

    PETER D. LAX

    New York, New York

    CHAPTER 1

    Fundamentals

    This first chapter aims to introduce the notion of an abstract linear space to those who think of vectors as arrays of components. I want to point out that the class of abstract linear spaces is no larger than the class of spaces whose elements are arrays. So what is gained by this abstraction?

    First of all, the freedom to use a single symbol for an array; this way we can think of vectors as basic building blocks, unencumbered by components. The abstract view leads to simple, transparent proofs of results.

    More to the point, the elements of many interesting vector spaces are not presented in terms of components. For instance, take a linear ordinary differential equation of degree n; the set of its solutions form a vector space of dimension n, yet they are not presented as arrays.

    Even if the elements of a vector space are presented as arrays of numbers, the elements of a subspace of it may not have a natural description as arrays. Take, for instance, the subspace of all vectors whose components add up to zero.

    Last but not least, the abstract view of vector spaces is indispensable for in infinite-dimensional spaces; even though this text is strictly about for infinite-dimensional spaces, it is a good preparation for functional analysis.

    Linear algebra abstracts the two basic operations with vectors: the addition of vectors, and their multiplication by numbers (scalars). It is astonishing that on such slender foundations an elaborate structure can be built, with romanesque, gothic, and baroque aspects. It is even more astounding that linear algebra has not only the right theorems but also the right language for many mathematical topics, including applications of mathematics.

    A linear space X over a field K is a mathematical object in which two operations are defined:

    Addition, denoted by +, as in

    (1)

    and assumed to be commutative:

    (2)

    and associative:

    (3)

    and to form a group, with the neutral element denoted as 0:

    (4)

    The inverse of addition is denoted by –:

    (5)

    EXERCISE 1. Show that the zero of vector addition is unique.

    The second operation is multiplication of elements of X by elements k of the field K:

    The result of this multiplication is a vector, that is, an element of X.

    Multiplication by elements of K is assumed to be associative:

    (6)

    and distributive:

    (7)

    as well as

    (8)

    We assume that multiplication by the unit of K, denoted as 1, acts as the identity:

    (9)

    These are the axioms of linear algebra. We proceed to draw some deductions:

    Set b = 0 in (8); it follows from Exercise 1 that for all x

    (10)

    Set a = 1, b = −1 in (8); using (9) and (10) we deduce that for all x

    EXERCISE 2. Show that the vector with all components zero serves as the zero element of classical vector addition.

    In this analytically oriented text the field K will be either the field of real numbers or the field of complex numbers.

    An interesting example of a linear space is the set of all functions x(t) that satisfy the differential equation

    The sum of two solutions is again a solution, and so is the constant multiple of one. This shows that the set of solutions of this differential equation form a linear space.

    Solutions of this equation describe the motion of a mass connected to a fixed point by a spring. Once the initial position x(0) = p and initial velocity (0) = v are given, the motion is completely determined for all t. So solutions can be described by a pair of numbers (p, v).

    The relation between the two descriptions is linear; that is, if (p, v) are the initial data of a solution x(t), and (q, w) the initial data of another solution y(t), then the initial data of the solution x(t) + y(t) are (p + q, v + w) = (p, v) + (q, w). Similarly, the initial data of the solution kx(t) are (kp, kv) = k(p, v).

    This kind of relation has been abstracted into the notion of isomorphism.

    Definition. A one-to-one correspondence between two linear spaces over the same field that maps sums into sums and scalar multiples into scalar multiples is called an isomorphism.

    Isomorphism is a basic notion in linear algebra. Isomorphic linear spaces are indistinguishable by means of operations available in linear spaces. Two linear spaces that are presented in very different ways can be, as we have seen, isomorphic.

    Examples of Linear Spaces. (i) Set of all row vectors: (a1, …, an), aj in K; addition, multiplication defined componentwise. This space is denoted as Kn.

    (ii) Set of all real-valued functions f(x) defined on the real line, K = .

    (iii) Set of all functions with values in K, defined on an arbitrary set S.

    (iv) Set of all polynomials of degree less than n with coefficients in K.

    EXERCISE 3. Show that (i) and (iv) are isomorphic.

    EXERCISE 4. Show that if S has n elements, (i) and (iii) are isomorphic.

    EXERCISE 5. Show that when K = , (iv) is isomorphic with (iii) when S consists of n distinct points of .

    Definition. A subset Y of a linear space X is called a subspace if sums and scalar multiples of elements of Y belong to Y.

    Examples of Subspaces. (a) X as in Example (i), Y the set of vectors (0, a2, …, an–1, 0) whose first and last component is zero.

    (b) X as in Example (ii), Y the set of all periodic functions with period π.

    (c) X as in Example (iii), Y the set of constant functions on S.

    (d) X as in Example (iv), Y the set of all even polynomials.

    Definition. The sum of two subsets Y and Z of a linear space X, denoted as Y + Z, is the set of all vectors of form y + z, y in Y, z in Z.

    EXERCISE 6. Prove that Y + Z is a linear subspace of X if Y and Z are.

    Definition. The intersection of two subsets Y and Z of a linear space X, denoted as Y Z, consists of all vectors x that belong to both Y and Z.

    EXERCISE 7. Prove that if Y and Z are linear subspaces of X, so is Y Z.

    EXERCISE 8. Show that the set {0} consisting of the zero element of a linear space X is a subspace of X. It is called the trivial subspace.

    Definition. A linear combination of j vectors x1, …, xj of a linear space is a vector of the form

    EXERCISE 9. Show that the set of all linear combinations of x1, …, xj is a subspace of X, and that it is the smallest subspace of X containing x1, …, xj. This is called the subspace spanned by x1, …, xj.

    Definition. A set of vectors x1, …, xm in X span the whole space X if every x in X can be expressed as a linear combination of x1, …, xm.

    Definition. The vectors x1, …, xj are called linearly dependent if there is a nontrivial linear relation between them, that is, a relation of the form

    where not all k1, …, kj are zero.

    Definition. A set of vectors x1, …, xj that are not linearly dependent is called linearly independent.

    EXERCISE 10. Show that if the vectors x1, …, xj are linearly independent, then none of the xi is the zero vector.

    Lemma 1. Suppose that the vectors x1, …, xn span a linear space X and that the vectors y1, …, yj in X are linearly independent. Then

    Proof. Since x1, …, xn span X, every vector in X can be written as a linear combination of x1, …, xn. In particular, y1:

    Since y1 ≠ 0 (see Exercise 10), not all k are equal to 0, say ki ≠ 0. Then xi can be expressed as a linear combination of y1 and the remaining xs. So the set consisting of the x’s, with xi replaced by y1 span X. If j n, repeat this step n − 1 more times and conclude that y1, …, yn span X: if j > n, this contradicts the linear independence of the y’s for then yn+1 is a linear combination of y1, …, yn.

    Definition. A finite set of vectors which span X and are linearly independent is called a basis for X.

    Lemma 2. A linear space X which is spanned by a finite set of vectors x1, …, xn has a basis.

    Proof. If x1, …, xn are linearly dependent, there is a nontrivial relation between them; from this one of the xi can be expressed as a linear combination of the rest. So we can drop that xi. Repeat this step until the remaining xj are linear independent: they still span X, and so they form a basis.

    Definition. A linear space X is called finite dimensional if it has a basis.

    A finite-dimensional space has many, many bases. When the elements of the space are represented as arrays with n components, we give preference to the special basis consisting of the vectors that have one component equal to 1, while all the others equal 0.

    Theorem 3. All bases for a finite-dimensional linear space X contain the same number of vectors. This number is called the dimension of X and is denoted as

    Proof. Let x1, …, xn be one basis, and let y1, …, ym be another. By Lemma 1 and the definition of basis we conclude that m n, and also n m. So we conclude that n and m are equal.

    We define the dimension of the trivial space consisting of the single element 0 to be zero.

    Theorem 4. Every linearly independent set of vectors y1, …, yj in a finite-dimensional linear space X can be completed to a basis of X.

    Proof. If y1, …, yj do not span X, there is some x1 that cannot be expressed as a linear combination of y1, …, yj. Adjoin this x1 to the y’s. Repeat this step until the y’s span X. This will happen in less than n steps, n = dim X, because otherwise X would contain more than n linearly independent vectors, impossible for a space of dimension n.

    Theorem 4 illustrates the many different ways of forming a basis for a linear space.

    Theorem 5. (a) Every subspace Y of a finite-dimensional linear space X is finite dimensional.

    (b) Every subspace Y has a complement in X, that is, another subspace Z such that every vector x in X can be decomposed uniquely as

    (11)

    Furthermore

    (11)′

    Proof. We can construct a basis in Y by starting with any nonzero vector y1, and then adding another vector y2 and another, as long as they are linearly independent. According to Lemma 1, there can be no more of these yi than the dimension of X. A maximal set of linearly independent vectors y1, …, yj in Y spans Y, and so forms a basis of Y According to Theorem 4, this set can be completed to form a basis of X by adjoining Zj+1, …, Zn. Define Z as the space spanned by Zj+1, …, Zn; clearly Y and Z are complements, and

    Definition. X is said to be the direct sum of two subspaces Y and Z that are complements of each other. More generally X is said to be the direct sum of its subspaces Y1, …, Ym if every x in X can be expressed uniquely as

    (12)

    This relation is denoted as

    EXERCISE 11. Prove that if X is finite dimensional and the direct sum of Y1, …, Ym, then

    (12)′

    Definition. An (n – 1)-dimensional subspace of an n-dimensional space is called a hyperplane.

    EXERCISE 12. Show that every finite-dimensional space X over K is isomorphic to Kn; n = dim X. Show that this isomorphism is not unique when n is > 1.

    Since every n-dimensional linear space over K is isomorphic to Kn, it follows that two linear spaces over the same field and of the same dimension are isomorphic.

    Note: There are many ways of forming such an isomorphism; it is not unique.

    The concept of congruence modulo a subspace, defined below, is a very useful tool.

    Definition. For X a linear space, Y a subspace, we say that two vectors x1, x2 in X are congruent modulo Y, denoted

    if x1 − x2 ∈ Y. Congruence mod Y is an equivalence relation, that is, it is

    (i) symmetric: if x1 ≡ x2, then x2 ≡ x1.

    (ii) reflexive: x x for all x in X.

    (iii) transitive: if x1≡ x2; x2 ≡ x3, then x1 ≡ x3.

    EXERCISE 13. Prove (i)–(iii) above. Show furthermore that if x1 ≡ x2, then kx1 ≡ kx2 for every scalar k.

    We can divide elements of X into congruence classes mod Y. The congruence class containing the vector x is the set of all vectors congruent with X; we denote it by {x}.

    EXERCISE 14. Show that two congruence classes are either identical or disjoint.

    The set of congruence classes can be made into a linear space by defining addition and multiplication by scalars, as follows:

    and

    That is, the sum of the congruence class containing x and the congruence class containing z is the class containing x + z. Similarly for multiplication by scalars.

    EXERCISE 15. Show that the above definition of addition and multiplication by scalars is independent of the choice of representatives in the congruence class.

    The linear space of congruence classes defined above is called the quotient space of X mod Y and is denoted as

    The following example is illuminating: Take X to be the linear space of all row vectors (a1, …, an) with n components, and take Y to be all vectors y = (0, 0, a3, …, an) whose first two components are zero. Then two vectors are congruent mod Y iff their first two components are equal. Each equivalence class can be represented by a vector with two components, the common components of all vectors in the equivalence class.

    This shows that forming a quotient space amounts to throwing away information contained in those components that pertain to Y. This is a very useful simplification when we do not need the information contained in the neglected components.

    The next result shows the usefulness of quotient spaces for counting the dimension of a subspace.

    Theorem 6. Y is a subspace of a finite-dimensional linear space X; then

    (13)

    Proof. Let y1, …, yj be a basis for Y, j = dim Y. According to Theorem 4, this set can be completed to form a basis for X by adjoining xj+1, …, xn, n = dim X. We claim that

    (13)′

    form a basis for X /Y. To show this we have to verify two properties of the cosets (13)′:

    (i) They span X /Y.

    (ii) They are linearly independent.

    (i) Since y1, …, xn form a basis for X, every x in X can be expressed as

    It follows that

    (ii) Suppose that

    This means that

    Express y as ∑ diyi; we get

    Since y1, …, xn form a basis, they are linearly independent, and so all the ck and di are zero.

    It follows that

    So

    EXERCISE 16. Denote by X the linear space of all polynomials p(t) of degree < n, and denote by Y the set of polynomials that are zero at t1, …, tj, j < n.

    (i) Show that Y is a subspace of X.

    (ii) Determine dim Y.

    (iii) Determine dim X/Y.

    The following corollary is a consequence of Theorem 6.

    Corollary 6′. A subspace Y of a finite-dimensional linear space X whose dimension is the same as the dimension of X is all of X.

    EXERCISE 17. Prove Corollary 6′.

    Theorem 7. Suppose X is a finite-dimensional linear space, U and V two subspaces of X such that X is the sum of U and V:

    Denote by W the intersection

    Enjoying the preview?
    Page 1 of 1