2. PDF (letter size)
  3. PDF (legal size)

my mathematics cheat sheet

Nasser M. Abbasi

April 15, 2018 compiled on — Sunday April 15, 2018 at 04:48 PM


1 general notes
2 things to find out
3 Converting first order ODE which is homogeneous to separable ODE
4 Direct solving of some simple PDE’s
5 Fourier series flow chart
6 Linear combination of two solution is solution to ODE
7 To find the Wronskian ODE
8 Sturm-Liouville Notes
 8.1 Definitions
 8.2 Proof symmetry of operator
 8.3 Proof eigenvalues are real
 8.4 Proof eigenfunctions are unique
 8.5 Proof eigenfunctions are real
 8.6 Proof eigenfunctions are orthogonal with weight
 8.7 Special functions generated from solving singular Sturm-Liouville
 8.8 Some notes on S-L
 8.9 Methods of solutions to some problems
 8.10 Examples converting second order linear ODE to Sturm-Liouville
  8.10.1 Example 1
  8.10.2 Example 2
  8.10.3 Example 3
  8.10.4 Example 4
  8.10.5 Example 5
  8.10.6 Example 5
9 Green functions notes
10 Laplace transform notes
11 Some CAS notes
12 References

A place to keep quick notes about Math that I keep forgetting. This is meant to be a scratch notes to write math notes. Can contain errors and/or not complete. Use at your own risk.

1 general notes

  1. Methods to find Green function are

    1. Fredholm theory
    2. methods of images
    3. separation of variables
    4. Laplace transform

    reference wikipedia I need to make one example and apply each of the above methods on it.

  2. we can’t just say f z is Analytic and stop. Have to say f z is analytic at a point z0    or in a region. f z is analytic at a point z0    if the power series for f z expanded around z0    converges to f  z evaluated at z0    . (i.e. f z has Taylor series that converges to f z at the point).
    For a region, the above test will be true for every point in that region. For example, f z= √z is not analytic at z = 0
 0   . This is because f ′ z when evaluated at z = 0
 0   is a singularity (a fancy name for saying it blows up at z0 = 0   ). But f 0 = 0   , so we see that f z is not analytic at z0 = 0   since its power series does not converge to the same value as the function at z0 = 0
    But this is only for this point z0 = 0   . The function f z0 is analytic at another point, say z0 = 1   .
  3. In solving an ODE with constant coefficient just use the characteristic equation to solve the solution.
  4. In solving an ODE with coefficients that are functions that depends on the independent variable, as in y′′ x +q x y ′ x +p x y x= 0   , first classify the point x
 0    type. This means to check how p x and q x  behaves at x0    . We are talking about the ODE here, not the solution yet. There are 3 kinds of points. x0    can be normal, or regular singular point, or irregular singular point. Normal x0    means p x  and q x  have Taylor series expansion 
y x =  ∞n=0an x − x0 n    that converges to y x at x0    .
    Regular singular point x0    means that the above test fails, but limx →x0 x − x0q x has a convergent Taylor series, and also that limx→x0 x − x0 2p x  now has a convergent Taylor series at x0    . All this just means we can get rid of the singularity. i.e. x0    is a removable singularity. If this is the case, then the solution at x0    can be assumed to have a Frobenius series y x=    ∞  a  x − x n+¸
        n=0 n     0    where a  ⇔ 0
 0   and  must be integer values.
    The third type of point, is the hard one. Called irregular singular point. We can’t get rid of it using the above trick. So we also say the ODE has an essential singularity at x0    (another fancy name for irregular singular point, I think). What this means is that we can’t approximate the solution at x0    using either Taylor nor Frobenius series.
    If the point is an irregular singular point, then use the methods of asymptotic. See advanced mathematical methods for scientists and engineers chapter 3. For normal point, use           ∞     n
y x =   n=0 anx    , for regular singular point use           ∞     n+r
y x =  n=0anx    . Remember, to solve for r first. This should give two values. If you get one root, then use reduction of order to find second solution.
  5. For Taylor series, leading behavior is a0    no controlling factor? For Frobenius series, leading behavior term is a0x¸    and controlling factor is x ¸    . For asymptotic series, controlling factor is assumed to be eS x always. proposed by Carlini (1817)
  6. Method to find the leading behavior of the solution y x near irregular singular point using asymptotic is called the dominant balance method.
  7. When solving y ′′ + p xy′ + q xy = 0   for very small  then use WKB method, if there is no boundary layer between the boundary conditions. If the ODE non-linear, can’t use WKB, has to use boundary layer (B.L.).  Example ′′    ′
y  +yy  − y = 0   with y 0= 0,y 1= −2   then use BL.
  8. good exercise is to solve say y ′′ + 1+ x y ′ +y = 0   with y 0= y 1 using both B.L. and WKB and compare the solutions, they should come out the same.              (      )
y ∼ -2-− exp  −x− x2  + O   .
    1+x       ›   2›  with BL had to do the matching between the outer and the inner solutions. WKB is easier. But can’t use it for non-linear ODE.
  9. When there is rapid oscillation over the entire domain, WKB is better. Use WKB to solve Schrodinger equation where  becomes function of ℏ (Planck’s constant,               −34
6.62606957× 10    m2    kg/s)
  10. In second order ODE with non constant coefficient, y ′′ x+ p xy′ x+ q xy x= 0   , if we know one solution y1 x  , then a method called the reduction of order can be used to find the second solution y2 x . Write y2 x =u x y1 x  , plug this in the ODE, and solve for u  x . The final solution will be y x= c1y1 x+ c2y2 x . Now apply I.C.’s to find c ,c
 1  2    .
  11. To find particular solution to y ′′ x+ p xy′ x+ q xy x = f x , we can use a method called undetermined coefficients.  But a better method is called variation of parameters, In this method, assume yp x= u1 xy1 x+ u2 xy2 x where y1 x,y2 x  are the two linearly independent solutions of the homogeneous ODE and u1 x,u2 x are to be determined. This ends up with u  x= −    y2 x f-xdx
 1           W and u  x=    y1 x f-xdx
 2         W . Remember to put the ODE in standard form first, so a = 1   , i.e.   ′′
ay  x + ⋅⋅⋅ . In here, W is the Wronskian      ||           ||
W  = ||y1′x  y2′ x||
     y1 x  y2 x
  12. Two solutions of y′′ x +p xy ′ x +q xy x= 0   are linearly independent if W  x ⇔ 0   , where W is the Wronskian.
  13. For second order linear ODE defined over the whole real line, the Wronskian is either always zero, or not zero. This comes from Abel formula for Wronskian, which is              (    B-x  )
W   x= k exp −  A xdx  for ODE of form A  xy′′ +B xy′ + C xy = 0   . Since     (         )
exp  −   B-xdx  > 0
        A x   , then it is decided by k . The constant of integration. If k = 0   , then W  x = 0   everywhere, else it is not zero everywhere.
  14. For linear PDE, if boundary condition are time dependent, can not use separation of variables. Try Transform method (Laplace or Fourier) to solve the PDE.
  15. If unable to invert Laplace analytically, try numerical inversion or asymptotic methods. Need to find example of this.
  16. Cauchy-Goursat: If f z is analytic everywhere inside closed contour C then 
   f zdz = 0

C   .
  17. Cauchy residue: If f z is analytic everywhere inside closed contour C except at some points z1,z2,⋅⋅⋅,zN    then 
   f zdz = 2  i   N Res  f z
                 j=1         z=zj
C     . The term Res f z
        z=zj     is the residue of f z at point zj    . Use Laurent expansion of f z to find residues.
  18. Green function takes the homogeneous solution and the forcing function and constructs a particular solution. For PDE’s, we always want a symmetric Green’s function.
  19. To get a symmetric Green’s function given an ODE, start by converting the ODE to a Sturm-Liouville form first. This way the Green’s function comes out symmetric.
  20. For numerical solutions of field problems, there are basically two different problems: Those with closed boundaries and those with open boundaries but with initial conditions. Closed boundaries are elliptical problems which can be cast in the form Au = f , and the other are either hyperbolic or parabolic.
  21. For numerical solution of elliptical problems, the basic layout is something like this: Always start with trial solution u x such that 
utrial x=   i=i=N0 Ci  i x  where the Ci    are the unknowns to be determined and the i    are set of linearly independent functions (polynomials) in x . How to determine those Ci    comes next. Use either residual method (Galerkin) or variational methods (Ritz). For residual, we make a function based on the error R = A − utrialf . It all comes down to solving 
f R = 0   over the domain. This is a picture
    |                                                     |
    residual                       Variational (sub u_trial in I(u)
    |                          where I(u) is functional to minimize.
    |                |             |          |
    Absolute error   collocation   subdomain orthogonality
    |                      |            |
    method of moments   Galerkin     least squares
  22. Geometric probability distribution. Use when you want an answer to the question: What is the probability you have to do the experiment N times to finally get the output you are looking for, given that a probability of p showing up from doing one experiment.

    For example: What is the probability one has to flip a fair coin N times to get a head? The answer is P X  = N = 1− pk−1p . So for a fair coin, p = 12    that a head will show up from one flip. So the probability we have to flip a coin 10   times to get a head is P X = 10 = 1 − 0.59 0.5= 0.00097   which is very low as expected.

  23. To generate random variable drawn from some distribution different from uniform distribution, by only using uniform distribution U 0,1 do this: Lets say we want to generate random number from exponential distribution with mean  . This distribution has          -1 −—x
pd f X = —e     , the first step is to find the cdf of exponential distribution, which is known to be                        −x
F x = P X <= x= 1 − e —     . Now find the inverse of this, which is F −1 x = − ln 1− x . Then generate a random number from the uniform distribution U 0,1 . Let this value be called z . Now plug this value into F −1 z , this gives a random number from exponential distribution, which will be −   ln 1 − z (take the natural log of both side of F x ).

    This method can be used to generate random variables from any other distribution by knowing on U  0,1 . But it requires knowing the CDF and the inverse of the CDF for the other distribution. This is called the inverse CDF method. Another method is called the rejection method

  24. Given u , a r.v. from uniform distribution over [0,1], then to obtain v , a r.v. from uniform distribution over [A,B], then the relation is v = A + B − Au .
  25. When solving using F.E.M. is best to do everything using isoparametric element (natural coordinates), then find the Jacobian of transformation between the natural and physical coordinates to evaluate the integrals needed. For the force function, using Gaussian quadrature method.
  26. A solution to differential equation is a function that can be expressed as a convergent series. (Cauchy. Briot and Bouquet, Picard)
  27. To solve a first order ODE using integrating factor.
    x′ t +p t x t= f t

    then as long as it is linear and p t,f t are integrable functions in t , then follow these steps

    1. multiply the ODE by function I t , this is called the integrating factor.
I t x  t +I t p t x t= I t f t

    2. We solve for I t such that the left side satisfies
      d- I tx t= I t x′ t+ I t p t x t

    3. Solving the above for I t gives

      Integrating both sides gives

    4. Now eq(1) can be written as
dt I tx t= I t f t

      We now integrate the above to give


      Where I t is given by (2). Hence

      --e-p-t dtf-t dt +-C
x t=          p t dt

  28. a polynomial is called ill-conditions if we make small change to one of its coefficients and this causes large change to one of the roots.
  29. to find rank of matrix A by hand, find the row echelon form, then count how many zero rows there are. subtract that from number of rows, i.e. n .
  30. to find the basis of the column space of A , find the row echelon form and pick the columns with the pivots, there are the basis (the linearly independent columns of A ).
  31. For symmetric matrix A , its second norm is its spectral radius A which is the largest eigenvalue of A (in absolute terms).
  32. The eigenvalues of the inverse of matrix A is the inverse of the eigenvalues of A .
  33. If matrix A of order n × n , and it has n distinct eigenvalues, then it can be diagonalized  A = V  V −1    , where
  e 1   0   0
=    0   ...  0
   0    0  e

    and V is matrix that has the n eigenvectors as its columns.

  34.          x             x
limk→ ∞  x2fk xdx =  x2limk → ∞ fk xdx
        1             1 only if fk x converges uniformly over x1,x2
  35. A3 = I , has infinite number of A solutions. Think of A3    as 3 rotations, each of 1200    , going back to where we started. Each rotation around a straight line. Hence infinite number of solutions.
  36. How to integrate        ----
I =    √x3−1dx
       x .

    Let u = x3 + 1   , then du = 3x2dx and the above becomes

           √ --          √ --
I =   -u-du = 1   ---u-du
      3x3     3   u − 1

    Now let u = tan2v or √u-= tanv , hence 1√1du = sec2vdv
2 u and the above becomes


    But tan2v − 1 = sec2v hence


    Substituting back

        2 (√ --       (√-))
I = 3   u − arctan  u

    Substituting back

        2(√ ------       (√-----) )
I = 3   x3 + 1− arctan  x3 +1

  37. (added Nov. 4, 2015) Made small diagram to help me remember long division terms used.
  38. If a linear ODE is equidimensional, as in anxny n+ an−1xn−1y n01+...  for example x2y′′ − 2y = 0   then use ansatz y = xr    this will give equation in r only. Solve for r and obtain y  = xr1,y = xr2
 1       2     and the solution will be
    y = cy  +c y
    1 1   2 2

    For example, for the above ode, the solution is    2  c2
c1x + x    . This ansatz works only if ODE is equidimensional. So can’t use it on xy′′ +y = 0   for example.

    If r is multiple root, use xr,xr log x,xr log x2...  as solutions.

  39. for xi    , where i = √ −1   , write it as x = elogx    hence xi = eilogx = cos logx +i sin log x
  40. For Sturm Liouville, write it as py ′′ + p′y′ + q +   y = 0   . Now it is easy to compare to. For example, x′′ +  x = 0   , then we see that p = 1,q = 0, = 1   . And for xy′′ + x′ + xy = 0   then p = 1,q = 0,   = x . Do not divide by x first to re-write xy′′ + x′ + xy = 0   as y ′′ + 1x ′ + y = 0
     x   and then compare, as this makes it hard to compare to standard form and will make error.  Books write Sturm Liouville as d- py′+qy +     y = 0
dx   but I find this confusing to compare to this way. Better to expand it to    ′′   ′ ′
py  + p y + q +     y = 0   first so now it is in a more familiar form and can read it out more directly.
  41. Some integral tricks: √------
a2 − x2dx use x = asin . For √------
a2 + x2dx use x = a tan and for √------
x2 − a2dx use x = a sec .
  42. y ′′ + xny = 0   is called Emden-Fowler form.
  43. For second order ODE, boundary value problem, with eigenvalue (Sturm-Liouville), remember that having two boundary conditions is not enough to fully solve it. One boundary condition is used to find first constant of integration, second boundary condition is used to find the eigenvalues. We still need another input to find the second constant of integration. This is normally done by giving the initial value. This problem happens as part of initial value, boundary value problem. The point is, with boundary value and eigenvalue also present, we need 3 inputs to fully solve it. Two boundary conditions is not enough.
  44. If given ODE y′′ x+ p xy′ x+ q xy x = 0   and we are asked to classify singular at x = ∞ , then let x = 1
    t    and check what happens at t = 0   . The d2-
dx2     operator becomes (           )
 2t3-d +t4-d2
   dt    dt2  and ddx    operator becomes − t2 ddt    . And write the ode now where t is the independent variable, and follow standard operating procedures. i.e. look at limt→0 xp t and limt→0 x2q t and see if these are finite or not. To see how the operator are mapped, always start with     1
x = t    then write d-   d-dt
dx = dt dx    and write -d2   ( d-) ( d-)
dx2 =  dx   dx  . For example, -d     2 d
dx = −t dt    and

    Then the new ODE becomes


    The above is how the ODE will always become after the transformation. Remember to change p x  to p t using     1
x = t    and same for q x . Now the new p is  −p t+2t
---t2--     and the new q is q t
t4     . Then do        (     )
lim    t-p t +2t3
   t→0     t4     and lim    t2q t
  t→0   t4     as before.

  45. If the ODE a xy′′ +b x y′ + c x y = 0   ,  and say 0 ≤ x ≤ 1   , is either non-linear or linear, and there is essential singularity at either end, then use boundary layer or WKB. But Boundary layer method works on non-linear ODE’s (and also on linear ODE) and only if the boundary layer is at end of the domain, ie. at x = 0   or x = 1   . WKB method on the other hand, works only on linear ODE, but the singularity can be any where (i.e. inside the domain). As rule of thumb, if the ODE is linear, use WKB. If the ODE is non-linear, we must use boundary layer. Another difference, is that with boundary layer, we need to do matching phase at the interface between the boundary layer and the outer layer in order to find the constants of integrations. This can be tricky and is the hardest part of solving using boundary layer. Using WKB, no matching phase is needed. We apply the boundary conditions to the whole solution obtained. See my HWs for NE 548 for problems solved from Bender and Orszag text book.
  46. In numerical, to find if a scheme will converge, check that it is stable and also check that if it is consistent. It could also be conditionally stable, or unconditionally stable, or unstable. To check it is consistent, this is the same as finding the LTE (local truncation error) and checking that as the time step and the space step both go to zero, the LTE goes to zero. What is the LTE? You take the scheme and plug in the actual solution in it. An example is better to explain this part. Lets solve ut = uxx    . Using forward in time and centered difference in space, the numerical scheme (explicit) is

                   (               )
U nj+1= U nj +-k  Unj−1 − 2Unj +U nj+1

    The LTE is the difference between these two (error)

                (      k (               ))
LTE = Unj+1−  Ujn+ -2  Unj−1 − 2U nj +U nj+1

    Now plug-in u (tn,xj)  in place of U n
  j    and u (tn +k,xj)  in place of U n+1
 j    and plug-in u  tn,x + h in place of U n
  j+1    and plug-in u  tn,x − h in place of U n
  j−1    in the above. It becomes

            (       )  ( ( n   )  k  (             (n   )           ))
LT E =u  t + k,xj − u t ,xj + h2 u t,x − h − 2u t,xj + u t,x +h

    Where in the above k is the time step (also written as t ) and h is the space step size. Now comes the main trick. Expand the term   (n      )
u t  +k,xj  in Taylor,

                                |          |
  (n      )    ( n  )    ∂u-|   k2--∂2u-|     ( 3)
u t  +k,xj  =u  t ,xj  +k  ∂t||n + 2  ∂t2||n + O k
                            t          t

    And expand

                                |     2  2 |
u (tn,x + h) = u (tn,x) + h-∂u|| + h-∂-u-|| + O (h3)
      j             j     ∂x|xj   2 ∂x2 |xj

    And expand

                                |     2  2 |
u (tn,x − h) = u (tn,x) − h-∂u|| + h-∂-u-|| − O (h3)
      j             j     ∂x|xj   2 ∂x2 |xj

    Now plug-in (2,3,4) back into (1). Simplify, many things drop out, and we should obtain that

                  ( )
LT E =O  k+ O h2

    Which says that LTE →  0   as h →  0,k →  0   . Hence it is consistent. To check it is stable, use Von Neumann method for stability. This check if the solution at next time step does not become larger than the solution at the current time step. There can be condition for this. Such as it is stable if k ≤ h22    . This says that using this scheme, it will be stable as long as time step is smaller than h2
 2    . This makes the time step much smaller than space step.  

2 things to find out

  1. find out why I get different answers here. branch cut handling?

    Mathematica 9.01

    Clear[x, y, k1, k2, k3];
    sol = Solve[Cos[x - y] == k1 + k2*Cos[y] + k3*Cos[x], y]
    sol /. {x -> 1, k1 -> 2, k2 -> 3, k3 -> 4}


    {{y -> -2.81197 - 1.04751 I},
    {y -> 2.81197 + 1.04751 I},
    {y -> -2.81197 + 1.04751 I},
    {y -> 2.81197 - 1.04751 I}}

    Maple 18

    eq:= cos(x-y)=k1+k2*cos(y)+k3*cos(x):


    {2.811969896 - 1.047512037 I,
    2.811969896 + 1.047512037 I}

3 Converting first order ODE which is homogeneous to separable ODE

(Added July, 2017).

If the ODE M  x,y+ N x,y dy= 0
                dx   has both M and N homogenous functions of same power, then this ODE can be converted to separable. Here is an example. We want to solve

(3     2)   (   2   3) ′
x  +8x y  + 4xy  − y  y = 0

The above is homogenous in M, N , since the total powers of each term in them is 3   .

3       3        3      3
◜◞◟◝    ◜◞◟◝         ◜ ◞◟◝  ◜ ◞◟◝
x3  + 8x2y    +   4xy2  −  y3   y ′ = 0

So we look at each term in N and M and add all the powers on each x,y in them. All powers should add to same value, which is 3   in this case. Of course N ,M should be polynomials for this to work. So one should check that they are polynomials in x,y before starting this process. Once we check M, N are homogeneous, then we let

y = xv

Therefore now





 ′        ′
y = v +xv

Substituting (3,4,5) into (1) gives


Dividing by x3 ⇔ 0   it simplifies to

        (  3   4)    (  2   3) ′
1+ 8v+  4v − v  + x  4v  − v  v = 0

Which can be written as


We see that it is now separable. We now solve this for v x  by direct integration of both sides And then using y = xv find y x .

4 Direct solving of some simple PDE’s

Some simple PDE’s can be solved by direct integration, here are few examples.

Example 1

∂z x,y
------- = 0

Integrate w.r.t. x ., and remember that now constant of integration will be function of y , hence

z x,y= f y

Example 2

∂z-x,y-= x

Integrating once w.r.t. x gives

∂z x,y   x2
------- = -- + f y
  ∂x      2

Integrating again gives

z x,y=  6-+ xf y +g y

Example 3

∂-z x,y-= y

Integrating once w.r.t. y gives

∂z x,y   y2
------- = -- + f x
   ∂y     2

Integrating again gives

z x,y=  6-+ yf x +g x

Example 4

∂-z-x,y-= 0
 ∂x ∂y

Integrating once w.r.t x gives

 ∂z x,y 
------- = f y

Integrating again w.r.t. y gives

z x,y =   f y dy + g x

Example 5

Solve u + u  = 0
 t   x   with u x,1 = -x--
         1+x2     . Let u ≡ u x t ,t , therefore

du-= ∂u-+ ∂udx--
dt   ∂t   ∂x dt

Comparing the above with the given PDE, we see that if ddxt = 1   then ddut = 0   or u x t,t is constant. At t = 1   we are given that

u = --x-1-2
    1+ x 1

To find x 1 , from dx = 1
 dt   we obtain that x t= t +c . At t = 1   , c = x 1− 1   . Hence x t= t +x  1− 1   or

x 1 = x t + 1 − t

Hence solution from (1) becomes

u = --x-−-t +-1--
    1+ x − t + 12

Example 6

Solve ut + ux +u2 = 0   .

Let u ≡ u x t,t , therefore

du-= ∂u-+ ∂udx--
dt   ∂t   ∂x dt

Comparing the above with the given PDE, we see that if dx
dt = 1   then du-     2
dt = −u    or −1
 u = −t + c.  Hence

u = t + c

At t = 0   , c = u-x1 0,0 . Let u x 0,0= f x  0 . Therefore

u = -----1---
    t + f x 0

Now we need to find x 0 . From ddxt = 1   , then x = t +c or c = x 0 , hence x  0= x − t and the above becomes

            1        f x − t
u x,t= -----1---= ------------
        t + f x−t  tf x − t +1

5 Fourier series flow chart

(added Oct. 20, 2016).


6 Linear combination of two solution is solution to ODE

If y1,y2    are two solutions to ay′′ +by ′ +cy = 0   then to show that c1y1 + c2y2    is also solution:


Multiply the first ODE by c1    and second ODE by c2


Add the above two equations, using linearity of differentials

a c1y1 +c2y2′′ + b c1y1 + c2y2′ + c c1y1 + c2y2= 0

Therefore c1y1 + c2y2    satisfies the original ODE. Hence solution.

7 To find the Wronskian ODE


        |     |
W  x = ||y1  y2||=y y ′− y y′
        |y′1  y′2|   12   2 1

Where y1,y2    are two solutions to ay′′ +by′ + cy = 0.  Write


Multiply the first ODE above by y2    and the second by y1


Subtract the second from the first

  (  ′′     ′′)    (  ′     ′)
a y2y1 − y1y 2 + p y2y1 − y1y2 = 0


  (   ′    ′)
p  y2y1 − y1y2 = −pW



Substituting (2,3) into (1) gives the Wronskian differential equation


Whose solution is

W   x= Ce−  adx

Where C is constant of integration.

Remember: W  x0 = 0   does not mean the two functions are linearly dependent. The functions can still be Linearly independent on other interval, It just means x
 0    can’t be in the domain of the solution for two functions to be solutions. However, if the two functions are linearly dependent, then this implies W  = 0   everywhere.  So to check if two functions are L.D., need to show that W  = 0   everywhere.

8 Sturm-Liouville Notes

8.1 Definitions

Regular Sturm-Liouville ODE is an eigenvalue boundary value ODE. This means, it the ODE has an eigenvalue in it  , where solution exists only for specific values of eigenvalues. The ODE is

   (       )
-d- p x dy- +q x y x+      xy x = 0    a < x < b
dx       dx


py′′ + p′y′ + q +   y = 0

With the restrictions that p x,q x,    x are real functions that are continuous everywhere over a ≤ x ≤ b and also we need p x> 0,   x > 0   . The x is called the weight function. But this is not all. The boundary conditions must be linear homogeneous, of this form


Where i    are just real constants. Some of them can be zero but not all. For example, 1y a = 0, 4y′ b= 0   is OK. So boundary conditions do not have to mixed, but they can be in general. But they must be homogeneous. Notice that periodic boundary conditions are not allowed. Well, they are allowed, but then the problem is no longer called Sturm-Liouville. The above is just the definition of the equation and its boundary conditions. Below is list of the important properties of this ODE. Each one of these properties have a proof.

  1. All eigenvalues 
n    are real. No complex 
n    .
  2. Each eigenvalue n    , will have one and only one real eigenfunction n x associated with it. (in higher dimensions, the eigenvalue problem can have more than one eigenfunction associated with one eigenvalue. For example, heat PDE in rectangle). But we can use Gram-schmidt to make these eigenfunctions linearly independent if we need to). But for exam, just worry about 1D for now.
  3. There is smallest  , called 1    and there are infinite number of eigenvalues. Hence eigenvalues are ordered 1 <   2 <  3⋅⋅⋅
  4. Each eigenfunction n x will have n − 1   zeros in a < x < b . Note that this does not include the end points. This means, 3 x will have two zeros inside the domain. i.e. there are two x locations where 3 x= 0   in a < x < b .
  5. Eigenfunctions n x make a complete set of basis. This means any piecewise continuous function f x can be represented as f x∼    ∞  an  n x 
        n=1 . This is called the generalized Fourier series.
  6. This follows from (5). Each eigenfunction is orthogonal to another eigenfunction, but with the weight in there. This means 
 b  n x   m x    xdx = 0
a   if n ⇔ m .
  7. Rayleigh quotient relates an eigenvalue to its eigenfunction. Starting with the SL ODE p   ′ ′ + q = − , then multiplying by and integrating, we obtain  b     ′′    2
a    p   + q   dx =       b    2
− a         dx and solving for  , gives
       b     ′′    2
=  a -- -p----+q---dx-
     b  2   x dx

    Carrying integration by parts on first part of the integral in numerator, it becomes

           ′a     b   ′ 2    2
  −p ----b-+-a-p-----−-q---dx
=           b 2
         a       x dx

    But this becomes much simpler when we plug-in the boundary conditions that we must use, making the above

   bp   ′2 − q 2dx
= -a --b-----------
    a   2   x dx

    And if q = 0,p = 1   and x = 1   , then it becomes     bffi′ 2dx
=  a-bffi2dx-.
   a  Rayleigh quotient is useful to show that  can be positive without solving for  and also used to estimate the value of the minimum  by replacing  by a trial function and actually solving for  using    b   ′′   2
= a-ffi-bpffi-+qffidx-
   a ffi2ff xdx    to obtain a numerical estimate of 1    .

  8. There is symmetry relation. In operator form, let        (      )
L ≡ -d p  xd-  +q x 
    dx      dx , then we have  b
a uL v
− vLu
dx = 0   where u,v are any two different eigenfunctions.
  9. There is also what is called Lagrange identity, which says uLv
− vL u 
= ddx p uv ′ − vu ′ . This comes from simply expanding L v
,Lu
 and simplifying things. Just calculus.
  10. Green formula, follows from Lagrange identity, which just gives the integral form   b
a uL v
 − vL u 
dx = p uv′ − vu ′
 ba    . But form Sturm-Liouville, we know that  buL v
− vLu
dx = 0
a   (from 8). So this really just says that p uv′ − vu′
b = 0
            a   . But we know this already from boundary conditions. So I am not sure why this is useful for Sturm-Liouville now. Since it is just saying the same thing again.
  11. If p  x= 0   at the left or right end (boundaries), then the problem is now called singular Sturm-Liouville. This is actually the important case. In regular S.L., p x must be positive everywhere. We only consider p x= 0   at the ends, not in the middle or any other place. When this happens, then solutions to the ODE generate special functions, such as Bessel, Legendre, Chebyshev. These special functions are solution to the Singular S.L. ODE, not the regular S.L. ODE. In addition, at the end where p  x= 0   , the boundary conditions must be bounded. Not the regular boundary conditions above. For example, if p a = 0   where x = a is say the left end, the the boundary conditions at the left end must be y a < ∞,y ′ a < ∞ . Notice that singular can be at left end, right end or at both at same time. At the end where p x = 0   , the boundary conditions must be bounded type.
  12. Symmetry relation
  uL v
 − vL u 
dx = 0

    where u,v are are two eigenfunction. But remember, this does NOT mean the integrand is identically zero, or uL v
− vLu
= 0   . Only when u,v happened to have same eigenvalue, only then we can say that uL v
 − vL u 
= 0   . But for unique eigenfunctions only the integral form is zero  b
a uL v
− vLu
dx = 0.  This is important difference, used in the proof below, that different eigenfunctions have different eigenvalues (for the scalar SL case). For higher dimensions, we can have more than one eigenfunction for same eigenvalue.

Proofs for these properties follows

8.2 Proof symmetry of operator

Given regular Sturm Liouville (RSL) ODE


In operator form

       (       )
L ≡ -d- p x-d- + q x
    dx      dx

And (1) becomes

L y 
= −    y

When solving RSL ode, since it is an eigenvalue ODE with associated boundary conditions, we will get infinite number of non-negative eigenvalues. For each eigenvalue, there is associated with it one eigenfunction (for 1D case). Looking at any two different eigenfunctions, say u,v , then the symmetry relation says the following

  b              b
  uL v 
dx =    vL u
a             a

Now we will show the above is true. This requires integration by parts two times. We start from the LHS expression and at the end we should end up with the integral on the RHS. Let       b
I = a uL v
dx , then


Where we used        ( d ( d )   )    ( d ( d  )     )
Lv
 =  dx pdx  + q v =  dx  pdxv  +qv  in the above. Hence

         ◜------◞(◟ ------◝)
      b -d-  dv-        b
I =    udx  pdx  dx +    qvudx
    a                 a

Now we will do integration by parts on the first integral. Let       b    (    )
I2 =  a u ddx pdvdx dx . Using 
U dV = UV −   VdU , and if we let            d-( dv-)
U = u,dV  = dx p dx  , then       du-      dv
dU =  dx ,V  =p dx    . Hence

                   dV    U
     (    )b      b◜◞ ◟◝◜◞◟◝
I2 = up dv-  −     dv- p du-dx
        dx a    a  dx    dx

We now apply integration by parts again to the second integral above. But now let U = pddux    and dV = dv
     dx    , hence         (   )
dU = -d  pdu
     dx   dx  and V = v , therefore the above becomes

    (     )   [(     )            (    )  ]
       dv- b     du-  b    b  -d-  du-
I2 = updx  a −  pdx v a − a  vdx  pdx  dx

Substituting the above into (2) gives

             [                          ]
   (  dv )b   ( du  )b     b   d ( du )         b
I = up---  −   p---v  −     v--- p--- dx  +    qvudx
      dx  a     dx   a   a   dx   dx         a

Now comes the part where the boundary conditions are important. In RSL, the boundary conditions are such that all the terms (  dv)b  ( du  )b
up dx a , p dxv a    vanish. This is because the boundary conditions are




So now (3) becomes


Therefore, we showed that   b             b
a uL v 
dx = a vL u
dx . The only thing to watch for here, is which term to make U and which to make dV when making integration by parts. To remember, always start with uLv
 and make dV the term with ddvx    in them during integration parts.

8.3 Proof eigenvalues are real

Assume  is complex. The corresponding eigenfunctions are complex also. Let  be corresponding eigenfunction

= −

Taking complex conjugate of both sides, and since  is real, we obtain

----   -- --
= −

But ----    [-]
 = L  since all the coefficients of the ODE are real. The above becomes

  [-]    ----
L     = −

But by symmetry, we know that

  b   [ ]
    L   -- −  -L  
dx = 0

Substituting  (1),(2) into the above gives


But --
     = |  | which is positive. Also the weight x  is positive by definition. Hence for the above to zero, it must be that  --
= . Which means  is real. QED.

So the main tools to use in this proof: Definition of L  
 = −      ,  and symmetry relation and that ----    [-]
= L  . This might come up in the exam.

8.4 Proof eigenfunctions are unique

Now will show that there is one eigenfunction associated with each eigenvalue (Again, this is for 1D, it is possible to get more than one eigenfunction for same eigenvalue for 2D, as mentioned earlier).  By contradiction, assume that  has two eigenfunctions   ,
1  2    associated with it. Hence


From the first equation,  −L ffi1
=  ffffi1     , substituting this into the second equation gives


By Lagrange identity,                 -d ( (   ′     ′))
1L  2
−   2L  1
= dx  p   1  2 −  2  1  , hence this means that


Where c1    is some constant. This is the main difference between the above argument, and between Lagrange identity. This can be confusing. So let me talk more about this. In Lagrange identity, we write

1L  2
−   2L  1
= -d-(p (  1 ′ −   2  ′))
                dx       2      1

And when 2,  1    also satisfy the SL boundary condition, only then we say that


But the above is not the same as saying 1L  2
−   2L  1
= 0   . This is important to keep in mind. Only the integral form is zero for any two functions with the SL B.C..Now we continue. We showed that  (          )
p   1  ′2 −  2  ′1  =c . In SL, this constant is zero due to B.C. Hence

 (   ′     ′)
p   1  2 −  2  1  = 0

But p > 0   by definition. Hence  ′     ′
1  2 −  2  1 = 0   or


or 2 =c2  1    . So the eigenfunctions are linearly dependent. One is just scaled version of the other. But eigenfunction must be linearly independent. Hence assumption is not valid, and there can not be two linearly independent eigenfunctions for same eigenvalue. Notice also that  ′     ′
1  2 −  2  1 = 0   is the just the Wronskian. When it is zero, we know the functions are linearly dependent. The important part in the above proof, is that  ′     ′
1  2 −  2  1 = 0   only when 1,  2    happened to have same eigenvalue.

8.5 Proof eigenfunctions are real

The idea of this proof is to assume the eigenfunction is complex, then show that its real part and its complex part both satisfy the ODE and the boundary conditions. But since they are both use the same eigenvalue, then the real part and the complex part must be linearly dependent. This implies the eigenfunction must be real. (think of Argand diagram)

Assume that = U + iV is complex eigenfunction with real part U and complex part V . Then since

= −

The above is just writing the Sturm-Liouville ODE in operator form, where L is the operator as above.  Now we have

LU + iV 
= −      U + iV

By linearity of operator L

L U 
+ iL V
= −     U − i    V

Which implies


So we showed that the real and complex part satisfy S.L. Now we need to show they are satisfy S.L. boundary conditions. Since


Where x = a,x = b are the left and right ends of the domain. Then




So the above means both U and V satisfy the boundary conditions of S.L. But since both U,V have the same eigenvalue, then they must be linearly dependent, since we know with S.L. each eigenfunction (or one linearly dependent to it) have only one eigenvalue. This means

V = cU

Where c is some constant. In other words,


Where c0    is new constant. (OK it happens to be complex constant, but it is OK to do so, we always do this trick in other places, if it will make me feel better, I could take the magnitude of the constant). So all what the above says, is that we assumed  to be complex, and found that it is real. So it can’t be complex.

8.6 Proof eigenfunctions are orthogonal with weight

Given two different eigenfunctions   ,
1  2    . Hence


From symmetry integral relation, since these eigenfunctions also satisfy S.L. boundary conditions, we can write

    1L  2
−   2L  1
dx = 0

Replacing (1,2) into the above


But 1 −  2⇔ 0   since there are different eigenvalues for different eigenfunctions. Hence

       1 x  2 xdx = 0

Which means 1,  2    are orthogonal to each others with weight x

8.7 Special functions generated from solving singular Sturm-Liouville

When S.L. is singular, meaning p = 0   at one or both ends, we end with important class of ODE’s, whose solutions are special functions (not sin   or cos   ) as the case with the regular S.L. Recall that S.L. is


With the regular S.L., we say that p x> 0   over the whole domain, including end points. But with singular, this is not the case.  Here are three important S.L. ODE’s that are singular.

Bessel equation:

  ′′   ′′
xy  +x y  = −    y    0 < x < 1

Or in standard form

xy′ ′ = −    y

So we see that p x= x,q x= 0,   x = x . We see that at x = 0   , then p x = 0   , which what makes it singular. (also the weight happens to be zero also), but we only care about p being zero or not, at one of the ends. So to check if S.L. is singular or not, just need to check if p x is zero or not at one of the ends. As mentioned before, when p = 0   at one of the ends, we can’t use the standard B.C. for the regular S.L. instead, at the end where p = 0   , we must use what is called bounded boundary conditions, which is in this case y 0 < ∞,y′ 0< ∞ . The solution to this ODE will be in terms of Bessel functions. Notice that this happened to be singular, due to the domain starting at x = 0   . If the domain happened to be from x = 0.5   to say x = 1   , then it is no longer singular S.L. but regular one.

Legendre equation

((     )  )
  1− x2 y ′′ = −  y    0 < x < 1

Or it can be written as

(    2)  ′′     ′
1 − x  y − 2xy  = −  y

We see that p = 1− x2    . And now it happens to be that p = 0   at x = 1   . So it is singular at the other end compared to Bessel. In this case q = 0,  = 1   . Again, the boundary conditions at x = 1   must now be bounded. i.e. y  1< ∞,y ′ 1 < ∞ . On the other end, where p is not zero, we still use the standard boundary conditions  1y 0 +  2y′ 0= 0   .

Chebyshev equation

(√ ------) ′    √------
   1− x2y′  = −   1 − x2y     − 1 < x < 1


√ ----2-′′  1--2x--- ′    √ ----2-
  1− x y  − 2√1-−-x2y = −    1− x y

So we see that    √ ------         √------
p =  1− x2,q = 0,  =  1 − x2    . So where is p = 0   here? At x = − 1   then p = √1-− 1-= 0   and at x = −1   then p = 0   also. So this is singular at both ends. So we need to use bounded boundary conditions at both ends now to solve this.


The solution to this singular S.L. is given in terms of special function Chebyshev.

8.8 Some notes on S-L

  1. Solution to xy′′ +y ′ = − xy . Set in S-L form, we get py′ ′ + qy = −   y , then p = x,q = 0,  = x . For a < x < b with y b = 0,y a < ∞ . This is singular S-L. Using Frobenius series, the solution comes out to be           ∞      (√ ---)
y  x=   n=1cn J0     nx  , where J0    is Bessel function of first kind. √  ---
   n    are roots of      (    )
0 = J √  -b
    0  . (check if this should be       (√--     )
0 = J0     b − a  . cn    is constant, which still needs to be found.
  2. We can find approximate solution to S-L ODE for large eigenvalue, without solving the ODE using series method, but by using WKB. Given py ′ ′ + qy = −  y which is standard form S-L. Using physical optics correction, we obtain the solution as
                       (     (      ∘ ----- )        (      ∘ ----- ))
                 −1-       √--   x      t           √- -  x      t
y x ∼     xp x4  c1cos          ----dt  + c1sin           ----dt
                              0   p  t                0   p t

    Where c ,c
 1 2    are found from boundary conditions now. The above is valid for "large"  , and is found by first letting and then assuming y x= exp (1"   ∞n=0. And working though the WKB method. Remember, WKB only works for linear homogeneous ODE and is used to estimate solution for large  (or small ).

  3. in 1D, S-L is py ′ ′ + qy = −  y with 0 < x < L , with B.C. given by 1y 0+   2y′ 0 = 0   and 1y L+   2y′ L = 0,  and in higher dimensions, this problem becomes ∇ ⋅ p∇  + q  = − with boundary conditions now written as 1  +   2 ∇   ⋅ ^n= 0   on boundary    . In higher dimensions, ≡     x .
  4. Lagrange’s identity in 1D is
                      ( (          ))
uL v− vL u = d--p  udv-− vdu-
               dx     dx    dx

    where L above is the S-L operator, as in     -d ( d-)
L ≡ dx p dx  +q so that we write L u= −     y .  

  5. When it says eigenfunctions are normalized, it should mean that    b  2
a    n x   x dx = 1   where x is the weight function (comes from SL equation) and a < x < b is the domain. This is for 1D SL. Remember for example, that        (√--- )
 Lcos2    nx  dx = L2
0    . where √ ---
    n = nLı-    . Here, it is not normalized since the result is not one. The weight here is just one and here         (nı- )
n x= cos  L x
  6. Heat PDE in cylinder is ∂T r,z,„,t   (∂2T-  1∂T-  ∂2T   1-∂2T)
   ∂t   = k  ∂r2 + r ∂r + ∂z2 + r2 ∂„2 .  For steady state, it becomes, say cylinder has length L

    And for axis symmetric (no  dependency), it becomes

         2        ∂2T-  1 ∂T-  ∂2T-
0 = ∇ T r,z=  ∂r2 + r ∂r + ∂z2

    The solution is found by separation of variables. For the above is in terms of Bessel function order zero

              ∞    (∘ -- ) (  √ –z      √–-z)
T r,z=     J0     0r  Ane  n  +Bne   n

    which is found using series method T =    ∞n=0 anrn+¸    . The eigenvalues n    are zeros of       (√-- )
0 = J0    r0  where r0    is disk radius.

  7. Heat PDE in 2D polar is -∂T-r,„,t    (∂2T   1∂T-  1-∂2T-)
   ∂t   = k  ∂r2 + r∂r + r2∂„2  and for steady state it becomes 0 = ∂2T2 + 1∂T-+ 12-∂2T2
    ∂r    r∂r   r ∂„     . After separating, the r ODE becomes Euler ODE. Use guess R  r= rp    for solution. The solution will be
                   ∞            n    ∞           n
T  r,z = A0 +   An cos n  r +    Bn sin n  r
             n=1              n=1

  8. To change p  xy′′ + q xy ′ +r xy x= 0   to S.L. form, multiply the ODE by         (       )
= -1- exp    q xdx
 p x      p x
  9. In using method of eigenfunction expansion to solve nonhomogeneous B.C. PDE, the eigenfunction used n x are the ones from solving the PDE with homogeneous B.C. For example, if given ∂u    ∂2u-
∂t =k ∂x2     with nonhomogeneous B.C. , say u 0,t= A,u L,t= B , and we want to use            ∞
u  x,t ∼  n=1bn t  n x  on the nonhomogeneous B.C. PDE, then those n  x are from the corresponding homogeneous B.C. PDE. They should be something like    (    )
sin nıL x  and so on. See the table at the top. That is why we write ∼ above and not =   and remember not to do term-by-term differentiation in x when this is the case. We can do term-by-term differentiation only of the PDE itself also has homogeneous B.C. , even if it had a source term there also. The point is,    x
n always come from solving the nonhomogeneous B.C. PDE. (This normally means solving a Sturm-Liouville ODE).
  10. I found new relation for eigenvalue  , but it is probably not useful as the one the book has. Here how to derive it
    Given S-L
       (    )
-d- pdy-- +qy = −    y
dx   dx

    Let n x be the eigenfunction associated with eigenvalue n    . Since eigenfunctions satisfy the ODE itself, then we can write, for any arbitrary eigenfunction (subscript removed for clarity in what follows)


    Integrating both sides

      b            b           b              b
   p′  ′dx +    p   ′′dx +    q  dx = −          dx
a           a          a             a

    Looking at   b
a p   ′′dx . Integrating by part. Let u = p,dv =   ′′ → du = p′,v =   ′ , hence


    Substituting (3) into (2) gives



         ′ b    b
   p --
a +-a-q--dx-
= −       b    dx

    compare to the one in the book           b
= −pffiffi′ ab-+-a p ffi′ 2−qffi2dx
      abffi2ff xdx    .

  11. This is about using eigenfunction expansion to approximate f x . Given the eigenfunctions n x found by solving SL Ly
=   ry problem, then these eigenfunctions are called complete, in the mean sense, if  ∞
n=1bn  n x converges to the mean of f  x at each point in a ≤ x ≤ b . This requires only that f x  be square integrable over the domain. We can also say that the eigenfunctions are complete, in pointwise sense, if 
∞n=1bn   n x converges to f x at each point in a ≤ x ≤ b . This however requires that f x be continuous in a ≤ x ≤ b and f ′ x  be piecewise continuous. So clearly the first case is less strict than the second. This means if f x is only square integrable (and not necessarily continuos), then convergence to the mean of f x at each point can be obtained. To obtain the more strict point wise convergence, more restrictions are needed on f  x as mentioned above. Clearly point wise convergence implies convergence in the mean. But not the other way around.

8.9 Methods of solutions to some problems

Problem 1 If the problem gives S-L equations, and asks to find estimate on the smallest eigenvalue, then use Rayleigh quotient for  . And write              ′a   b   ′2  2
min =  1 ≤ −pffiffi-b-+b-a p-ffi−qffi-dx
              a ffi2ff xdx    . Now we do not need to solve the SL to find solution  , this is the whole point. Everything in RHS is given, except for, of course the solution  . Here comes the main idea: Come up with any trial trial    and use it in place of  . This trial function just needs to satisfy the boundary conditions, which is also given. Then all what we need to do is just evaluate the integral. Pick the simplest trial    function of x which satisfies boundary conditions. All other terms p,q, we can read from the given problem. At the end, we should get a numerical value for the integral. This is the upper limit of the lowest eigenvalue 1

Problem 2 We are given SL problem with boundary conditions, and asked to show that ≥ 0   without solving the ODE to find  : Use Rayleigh quotient and argue that the denominator can’t be zero (else eigenfunction is zero, which is not possible) and it also can’t be negative. Argue about the numerator not being negative. Do not solve the ODE! this is the whole point of using Rayleigh quotient.

Problem We are given an SL problem with boundary conditions and asked to estimate large  and corresponding eigenfunction. This is different from being asked to estimate the smallest eigenvalue, where we use Rayleigh quotient and trial function. In this one, we instead use WKB. For 1D, just use what is called the physical optics correction method, given by           (     (            )        (            ))
        −1-       √--  x∘ ff-t          √ - - x∘ ff-t-
x ≈    p4  c1 cos     0   p-t dt  + c2 sin      0   p-t dt  . Where ,p in this are all known functions (from the problem itself). Notice that q is not there. Now use the boundary conditions and solve for one of the constants, should come to be zero. Use the second boundary condition and (remember, the boundary conditions are homogenous), and we get one equation in  and for non-trivial solution, solve for allowed values of n    . This gives the large eigenvalue estimate. i.e. for n    when n is very large. Depending on problem, n does not have be too large to get good accurate estimate compared with exact solution. See HW7, problem 5.9.2 for example.

Problem We are given 1D heat ODE ∂u = k∂2u
∂t    ∂x2     . If B.C. are homogenous, we are done. We know the solution. Separation of variables. If the B.C. is not homogenous, then we have a small problem. We can’t do separation of variables. If you can find equilibrium solution, ur x , where this solution only needs to satisfy the non-homogenous B.C. then write solution as u x,t= v x,t+ur x  , and plug this in the PDE. This will give        2
∂∂vt-= k∂∂xv2     but this now satisfies the homogenous B.C. This is the case if the original non-homogenous B.C were Dirichlet. If they were Neumann, then we will get -∂v∂t = k ∂∂2vx2 + extra where extra is extra term that do not vanish because ur  x do not vanish in this case after double differentiating. We solve for v  x,t now, since it has homogenous B.C., but it has extra term there, which we treat as new source. We apply initial conditions now also to final all the eigenfunction expansion coefficients. We are done. Now we know u x,t.

But if we can find the reference function ur    or we do not want to use this method, then we can use another method, called eigenfunction expansion. We assume 
u x,t=   ∞n=1an t  n x and plug this in the PDE.  But now we can’t do term by term differentiation, since n    are the eigenfunctions that satisfies the homogenous B.C., while u x,t has non-homogenous B.C. So the trick is to use Green formula, to re-writes   L ∂2u    
0 k∂x2  n x dx as   L   ∂2ffin-x
0 ku  ∂x2 dx plus the contribution from the boundary terms. (this is like doing integration by parts twice, but much easier). Now rewrite   2  
-∂ffi∂nx2x = −  n  n x . See page 355-356, Haberman.

8.10 Examples converting second order linear ODE to Sturm-Liouville

Any linear second order ODE can be converted to S-L form. The S-L form is the following

(    dy )′
 p x---  − q xy x+   r xy x = 0

Some books write the above with a plus sign + q xy x  there (for example, Heberman) and some with − q x y x for example Boyce/DiPrima. I really never understood why some put a minus sign and some do not. May be I’ll find out one day. But for now will use (1) is used as the S-L form. If your book uses the plus sign, the same process works, just flip the sign.

The goal now, is given any general form eigenvalue ODE, we want to convert it (rewrite it) in the above form. The second order linear ODE will have this form

a xy′′ + b xy′ + c x+  y = 0

The parameter  in the S-L form, is the eigenvalue. We really only use S-L form for eigenvalue problems, but if the  is missing, then the form of the input will be

a x y′′ + b xy ′ +c xy = 0

And the process will work the same way, where = 0   in all the following.

First, we will show how to convert (2) to (1), and then show few examples. We are given (2), and want to convert it to (1). The first step is to convert (2) to standard form

y′′ + b-xy′ + c-x+--y = 0
    a  x     a x

Then multiply the above by some unknown x which is assumed positive. This is called the integrating factor

           b x        c x+
x y′′ +   x ----y′ +   x-------y = 0
           a x          a x

Now, rewrite   ′′
xy as    ′′   ′    ′
xy  −    x y in the above


Here comes the main trick in all of this. We want to force x ba x x−  ′ x to be zero. This implies        b xdx
x = e ax    . (this comes from just solving the ODE  b-x   ′
xa x−     x= 0   . Therefore, if 
x = e ba xxdx    , then (3) becomes


We are done. Compare (4) to the SL form

(       )′
 p xdy-  − q xy x+   r xy x = 0

 We see that


Let see how this works on some examples

8.10.1 Example 1

Convert y′′ +  y = 0   to S-L.  We see the general form here is a xy ′′ + b xy′ + c x+  y = 0   , with a = 1,b = 0,c = 0   . Hence        b xdx
x = e ax  = 1   , therefore the SL form is

(       )′
 p xdy-  − q xy x+   r xy x = 0



Hence (1) becomes

y +   y x = 0

This was easy, since the ODE given was already in SL form.

8.10.2 Example 2


 2 ′′    ′  ( 22    2)
x y  + xy +    x  − n  y = 0

Rewrite as

          (      )
 ′′   1-′      n2
y  +x y +    − x2  y = 0

The general form here is a xy′′ +b x y ′ + c x+ y = 0   , with a = 1,b = 1,c = −n2
         x      x2     . Hence

       b x
x= e  ax dx = e 1xdx = elnx = x

Therefore the SL form is

(       )′
 p xdx   − q xy x+   r xy x = 0



Therefore (1) becomes

(    )′   2
 xdy-  − n-y x+   xy x = 0
  dx     x

8.10.3 Example 3


 ′′  -2x---′   ------
y  − 1− x2y + 1 − x2y = 0

Rewrite as

(    2) ′′      ′
1 − x  y  − 2xy +  y = 0

The general form here is a x y′′ + b xy′ + c x+  y = 0   , with     (    )
a = 1 − x2 ,b = −2x,c = 0   . Hence

       bdx   −   -2x-dx   − −ln x−1 −ln x+1  lnx−1 lnx+1                 2
x= e  a  = e   1−x2  = e               = e     e     =  x − 1 x + 1= x − 1

Therefore the SL form is

(       )′
 p xdy-  − q xy x+   r xy x = 0



Hence (1) becomes

((2    ) dy )′
 x  − 1 dx-  +  y  x= 0

8.10.4 Example 4


(    2) ′′    ′
 1− x  y  − xy +   y = 0

We see the general form here is a x y′′ + b xy ′ + c x+ y = 0   , with     (    )
a = 1 − x2 ,b = −x,c = 0   . Hence

       bdx   −   -x-dx
x= e  a  = e   1−x2

But               |     |
1x−x2dx = − 12 ln|1− x2|  , therefore     1 ||  2||  √------
x= e2ln1−x  =  1 − x2    . Therefore the SL form is

(       )′
 p xdy-  − q xy x+   r xy x = 0



Hence (1) becomes

  1 − x2--- + √------y x = 0
        dx      1− x2

8.10.5 Example 5


a x y′′ + b xy ′ +c xy = 0

The general form here is a x y ′′ + b xy′ + c x+  y = 0   , with a = a x ,b = b x,c = c x and = 0   . Hence

x = e badx

Therefore the SL form is

(       )
     dy- ′
 p xdx   − q xy x+   r xy x = 0



Hence (1) becomes

(   b    )′     b  c x
e  adxy′  + e adx----y x = 0
                 a x

We see that the above is the same as a xy′′ + b xy′ + c xy = 0   becuase

(        )′
 e badxy′  = b-e badxy′ + e badxy ′′

And hence (2) becomes

b-   badx ′      badx  ′′     badx c-x
a e    y + e    y  + e     a xy x= 0

e  badx    gives


Which is the original ODE.

8.10.6 Example 5


3y′′ + 2y′ + 5y = 0

The general form here is a x y ′′ + b xy′ + c x+  y = 0   , with a = 3,b = 2,c = 5   and = 0   . Hence

x= e  badx = e 23dx = e23x

Therefore the SL form is

(       )
     dy- ′
 p xdx   − q xy x+   r xy x = 0



Hence (1) becomes

( 2   )′   2 (5 )
 e3xy′  + e3x -  y x= 0

The above is the same as 3y′′ + 2y ′ + 5y = 0   becuase

( 2  )′     2      2
e 3xy ′ =  2e3xy′ + e3xy′′

And hence (2) becomes

                   (  )
2  23x  ′  23x ′′   23x 5
3e  y  +e  y  + e   2  y x= 0

Cancelinge23x    gives


Which is the original ODE.

9 Green functions notes

  1. Green function is what is called impulse response in control. But it is more general, and can be used for solving PDE also. Given a differential equation with some forcing function on the right side. To solve this, we replace the forcing function with an impulse. The solution of the DE now is called the impulse response, which is the Green’s function of the differential equation. Now to find the solution to the original problem with the original forcing function, we just convolve the Green function with the original forcing function. Here is an example. Suppose we want to solve   Ly t
= f t with zero initial conditions. Then we solve L g t
=    t . The solution is g t . Now y t= g t⊛ f t . This is for initial value problem.  For example. y ′ t+ kx = eat    , with y  0= 0   . Then we solve g′ t+ kg =   t . The solution is        { e−kt t > 0
g t =
          0   t < 0   , this is for causal system. Hence y t= g t⊛ f t . The nice thing here, is that once we find g t , we can solve   ′
y  t+ kx = f t for any f t by just convolving the Green function (impulse response) with the new f t .
  2. We can think of Green function as an inverse operator. Given Ly t
 = f t , we want to find solution 
y  t = −∞∞ G t;  f    d . So in a sense, G  t;   is like L−1y t
  3. Need to add notes for Green function for Sturm-Liouville boundary value ODE. Need to be clear on what boundary conditions to use. What is B.C. is not homogeneous?
  4. Green function properties:

    1. G t;   is continuous at t = . This is where the impulse is located.
    2. The derivative G′ t just before t = is not the same as G′ t just after t = . i.e.   ′          ′
G  t;t − . This means there is discontinuity in derivative.
    3. G t;   should satisfy same boundary conditions as original PDE or ODE (this is for Sturm-Liouville or boundary value problems).
    4. LG  t;  
= 0   for t ⇔
    5. G x;    is symmetric. i.e. G  x;  = G    ;x  .
  5. When solving for G t;   , in context of 1D, hence two boundary conditions, one at each end, and second order ODE (Sturm-Liouville), we now get two solutions, one for t < and one for t > . So we have 4   constants of integrations to find (this is for second order ODE) not just two constants as normally one would get , since now we have 2 different solutions. Two of these constants from the two boundary conditions, and two more come from property of Green function as mentioned above.          {
           A1y1 + A2y2 t < 0
G  t;  =   A3y1 + A4y2 t > 0

10 Laplace transform notes

  1. Remember that uc tf t − c⇐ ⇒ e−csF s and uc tf t⇐ ⇒ e−csℒ {f t +c} . For example, if we are given u2 tt , then             −2s           −2s (-1 2)    −2s (1+2s)
ℒ  u2 tt= e  ℒ {t + 2} = e  s2 + s = e     s2  . Do not do uc tf t ⇐⇒  e−csℒ {f t} ! That will be a big error. We use this allot when asked to write a piecewise function using Heaviside functions.

11 Some CAS notes

  1. in Mathematica Exp is a symbol. Head[Exp] gives Symbol but in Maple it is not. In Maple

    gives {  ,  ,x,y,z} but in Mathematica


    gives {e,x,   ,z,Gamma,y  }

Notice that e shows up in Mathematica, but not in Maple.

12 References

Too many references used, but will try to remember to start recording books used from now on. Here is current list

  1. Applied partial differential equation, by Haberman
  2. Advanced Mathematical Methods for Scientists and Engineers, Bender and Orszag, Springer.
  3. Boundary value problems in physics and engineering, Frank Chorlton, Van Norstrand, 1969
  4. Class notes. Math 322. University Wisconsin, Madison. Fall 2016. By Professor Smith. Math dept.
  5. various pages Wikipedia.
  6. good note on Sturm-Liouville http://ramanujan.math.trinity.edu/rdaileda/teach/s12/m3357/lectures/lecture_4_10_short.pdf