Difference between revisions of "Variational numerical methods"

Latest revision as of 08:17, 9 January 2024

(for differential equations)

Among the principles that appear in physics there are such which are not expressed in terms of differentials but in variational form. These principles are called variational principles and they describe the conditions under which certain values attain extreme (i.e. maximum or minimum) values. Examples of these principles are Hamilton's principle, the principle of least action in classical mechanics (cf. also Principle of least reaction) and the Fermat principle in geometric optics; other examples can be found in mathematical physics, structural mechanics, fluid dynamics, heat transfer, etc.

The calculus of variations (cf. Variational calculus) is concerned with the problem of extreme values of functionals, which are real- or complex-valued functions defined on a subset of a function space. This subset consists of functions defined on a certain domain. An element of such a subset is called an admissible function (or argument function). General attention was first drawn to problems of this type in 1696 by Johann Bernoulli's statement of the brachistochrone problem (cf. Brachistochrone).

The fundamental criterion in the calculus of variations is as follows: A necessary and sufficient condition that the integral

$$ J( v) = \int\limits _ { x _ {0} } ^ { {x _ 1 } } F( x, v, v _ {x} ) dx, $$

with

$$ v _ {x} = \frac{dv }{dx } , $$

be stationary when $ v = u $ is that $ v = u $ be an admissible function satisfying the Euler equation

$$ \frac{\partial F }{\partial v } - \frac{d }{dx } \left ( \frac{\partial F }{\partial v _ {x} } \right ) = 0. $$

Its solutions are called extremals and to solve the problem one has, among all the extremals, to find one which satisfies prescribed boundary conditions.

The problem of finding the extreme values of an integral can be extended to the case of several independent variables and multiple integrals of the type

$$ J( v) = \int\limits F( x _ {1} \dots x _ {n} , v, v _ {x _ {1} } \dots v _ {x _ {n} } ) d \Omega , $$

with

$$ v _ {x _ {i} } = \frac{\partial v }{\partial x _ {i} } ,\ \ d \Omega = dx _ {1} \dots dx _ {n} . $$

The calculus of variations discusses the correspondence between the functional $ J( v) $ to be extremized and the Euler equation for the solution $ u $. A solution of an extremal problem can thus be studied by looking at the corresponding Euler equation. Conversely, when a differential equation is given, a functional whose Euler equation is the given differential equation can sometimes be constructed. In this case the extremal problem and the Euler equation are in some sense equivalent. To solve Euler's equation, boundary conditions (for instance, of Dirichlet- or Neumann-type) have to be prescribed. Those boundary conditions which are imposed on the admissible functions are called essential (e.g. Dirichlet-type), those which are automatically satisfied by the solution of the extremal problem and which are not explicitly imposed on the set of admissible functions are called natural (e.g. Neumann-type).

As an example, let $ \Omega $ be an open bounded domain in the $ n $- dimensional space $ \mathbf R ^ {n} $ and let $ f \in L _ {2} ( \Omega ) $ be a real-valued function. Consider the problem of minimizing the functional

$$ J( v) = \frac{1 }{2} \int\limits _ \Omega | \mathop{\rm grad} v | ^ {2} d \Omega - \int\limits _ \Omega fv d \Omega . $$

The set of admissible functions is the Hilbert space obtained by completing the space of infinitely-differentiable functions with compact support in $ \Omega $ with respect to the Sobolev norm

$$ \| v \| _ {1} = \left \{ \int\limits _ \Omega | \mathop{\rm grad} v | ^ {2} d \Omega + \int\limits _ \Omega v ^ {2} d \Omega \right \} ^ {1/2 } . $$

It can be shown, using the Riesz representation theorem (cf. Riesz theorem), that the extremal problem is equivalent (in the generalized, distributional sense) to the following Euler equation for $ u $( i.e. Poisson equation with Dirichlet boundary conditions):

$$ - \mathop{\rm div} \mathop{\rm grad} u = f \ \mathop{\rm in} \Omega , $$

$$ u = 0 \ \mathop{\rm on} \partial \Omega = \textrm{ boundary } \textrm{ of } \Omega . $$

There are some advantages in solving the extremal formulation (if it exists) instead of the differential equation, because some boundary conditions are automatically fulfilled and due to lower-order differentiation requirements.

Consider the problem of minimizing the functional $ J $ over a function space $ V $, and let $ \phi _ {1} \dots \phi _ {N} ,\dots $ be a basis for $ V $. Let $ V _ {N} $ be an approximation of the space $ V $ spanned by the first $ N $ basis functions $ \phi _ {1} \dots \phi _ {N} $. The approximate problem is defined as: Find $ u _ {N} \in V _ {N} $ minimizing the functional $ J $ on $ V _ {N} $.

Since $ u _ {N} $ can be written as

$$ u _ {N} = \sum _ {i= 1 } ^ { N } \alpha _ {i} \phi _ {i} , $$

with unknown $ \alpha _ {1} \dots \alpha _ {n} $, the approximate problem reduces to: Find $ \{ \alpha _ {1} \dots \alpha _ {N} \} $ such that $ J( \alpha _ {1} \dots \alpha _ {N} ) $ takes its extremum value.

This leads to the following $ N $( necessary) conditions for the $ N $ unknowns $ \alpha _ {1} \dots \alpha _ {N} $:

$$ \frac{\partial J( \alpha _ {1} \dots \alpha _ {N} ) }{\partial \alpha _ {i} } = 0,\ i = 1 \dots N. $$

The original extremal problem has been reduced to a system of $ N $ equations for $ N $ unknowns, which can be solved numerically. The accuracy of the approximate solution $ u _ {N} $ depends on the number $ N $ of basis functions. Generally, as $ N $ increases the accuracy improves, i.e. $ u _ {N} $ converges to $ u $.

The method that constructs a sequence $ u _ {1,\ } u _ {2,\dots} $ by determining the values of $ \alpha _ {i} $, $ i = 1, 2 \dots $ is called the Ritz method. Inclusion of $ V _ {m} $ in $ V _ {m+ 1 } $ is not essential; for convergence the approximate spaces must satisfy the following density property: $ \cup_{m=1}^ \infty V _ {m} $ dense in $ V $.

The Galerkin method, which can be considered as a generalization of the Ritz method, is directly applicable to the differential equation, irrespective of the existence of an equivalent extremal formulation. Galerkin's method can be illustrated as follows: Consider the following linear second-order differential equation with (essential) Dirichlet boundary conditions (a boundary value problem):

$$ {\mathcal L} u = f \ \mathop{\rm in} \Omega \subset \mathbf R ^ {n} , $$

$$ u = 0 \ \mathop{\rm on} \partial \Omega . $$

The solution is sought in a function space V which is supposed to have a basis $ \{ \phi _ {1} \dots \phi _ {N} ,\dots \} $. Any element $ w \in V $ can be written as an infinite linear combination of these basis functions. For arbitrary $ w \in V $, the expression $ {\mathcal L} w - f = R( w) $ is called the residual. The problem is to find an element $ u \in V $ for which the residual vanishes. This amounts to saying that the solution $ u $ is characterized by the following projection requirements:

$$ \int\limits _ \Omega ( {\mathcal L} u - f ) \phi _ {i} d \Omega = 0,\ \ i = 1, 2,\dots. $$

Reduction of the order of differentiation by Green's formula and substitution of the boundary conditions leads to the following variational (or weak) formulation of the boundary value problem: Find $ u \in V $ such that

$$ a( u, \phi _ {i} ) = \int\limits _ \Omega f \phi _ {i} d \Omega , \ i = 1, 2 \dots $$

where $ a( u, \phi ) $ is a bilinear form on $ V \times V $. When $ {\mathcal L} $ is, for instance, the Laplace operator, $ a( u, \phi ) $ is given by

$$ a( u, \phi ) = \int\limits _ \Omega \mathop{\rm grad} u \cdot \mathop{\rm grad} \phi d \Omega . $$

The Galerkin method consists of taking a finite-dimensional subspace $ V _ {N} $ of $ V $ spanned by $ N $ basis functions $ \phi _ {1} \dots \phi _ {N} $. The approximate finite-dimensional problem is defined as: Find $ u _ {N} \in V _ {N} $ such that

$$ a( u _ {N} , \phi _ {i} ) = \int\limits _ \Omega f \phi _ {i} d \Omega , \ i = 1 \dots N. $$

Since $ u _ {N} \in V _ {N} $, it has the form

$$ u _ {N} = \sum_{j=1} ^ { N } \alpha _ {j} \phi _ {j} , $$

and the approximate problem is equivalent to the following system of $ N $ linear algebraic equations for the unknowns $ \alpha _ {1} \dots \alpha _ {N} $:

$$ \sum _ {j=1} ^ { N } \alpha _ {j} a( \phi _ {i} , \phi _ {j} ) = \ \int\limits _ \Omega f \phi _ {i} d \Omega ,\ i= 1 \dots n . $$

This system can be solved by Gaussian elimination or by iteration.

When the coefficients of the differential operator $ {\mathcal L} $ and the function $ f $ are periodic in space and when sine or cosine basis functions are chosen, the Galerkin method is equivalent to the Fourier method and can be efficiently implemented using a fast Fourier transform (cf. Fourier transform, discrete). The Galerkin method can also be applied to the numerical solution of evolution equations, leading to a system of ordinary differential equations in time which can be solved numerically by integration techniques.

Both the Ritz method and the Galerkin method transform the infinite-dimensional problem into a (finite-dimensional) system of linear equations. The finite-element method is a systematic technique for generating basis functions on arbitrary domains $ \Omega $ which gives rise to a sparse matrix $ A $( i.e. a matrix with many zero entries). The finite-element method consists of three steps:

The first step is to divide the domain $ \overline \Omega \; = \Omega \cup \partial \Omega \subset \mathbf R ^ {n} $ into a finite number of non-overlapping subregions (elements) the length of which does not exceed $ h > 0 $. Standard examples of elements are subintervals ( $ n = 1 $), triangles in $ 2 $ dimensions and tetrahedra in $ 3 $ dimensions.

In the second step, a finite number of points (called nodal points) are chosen in each element. Examples are: begin- and end-point of the subintervals; the vertices of the triangles; or the vertices of the tetrahedra. More complicated examples are begin-, end- and mid-points of subintervals, or vertices and mid-points of sides of triangles. The (finite) set of all nodal points will be denoted by $ {\mathcal P} = \{ P _ {1} \dots P _ {N} \} $.

In the third step, a continuous basis function $ \phi _ {i} $ is defined on $ \overline \Omega \; $ for each nodal point $ P _ {i} \in {\mathcal P} $. These basis functions satisfy the following properties: 1) $ \phi _ {i} $ takes the value $ 1 $ at nodal point $ P _ {i} $ and vanishes at all the other nodal points; and 2) $ \phi _ {i} $ has a prescribed shape on each element (depending on the number of nodal points per element). For example, in the $ n $- dimensional case: linear basis functions with $ n+ 1 $ nodal points per element or quadratic basis functions with $ ( n+ 1)( n+ 2)/2 $ nodal points per element.

Using these basis functions an approximate solution $ u _ {N} $ can be constructed. In fact, the function

$$ u _ {N(} x) = \sum _ {j= 1 } ^ { N } \widetilde{u} _ {j} \phi _ {j} ( x) $$

is a linear combination of the basis functions $ \phi _ {j} $, continuous on $ \overline \Omega \; $, and with prescribed behaviour on the elements. Due to the particular construction of the basis functions, the parameter $ \widetilde{u} _ {j} $ is precisely the value of $ u _ {N} $ at nodal point $ P _ {j} $.

The resulting system of linear equations can be written as

$$ \sum _{j=1} ^ { N } \widetilde{u} _ {j} a( \phi _ {i} , \phi _ {j} ) = \ \int\limits _ \Omega f \phi _ {i} d \Omega ,\ i= 1 \dots n, $$

where the matrix entry $ a( \phi _ {i} , \phi _ {j} ) $ differs from zero only if the nodal points $ P _ {i} $ and $ P _ {j} $ belong to the same element. This implies that the matrix (often called the stiffness matrix) is sparse.

The convergence of $ u _ {N} $ to $ u $ can be proved if the set of elements satisfies certain regularity properties. To improve the accuracy, polynomials of higher degree per element are used. Curved boundaries can be taken into account by using isoparametric elements.

Generalizations and extensions of the finite-element method exist to problems of higher order (for instance, mixed finite elements for the biharmonic equation, cf. Biharmonic function), to problems with specific difficulties (for instance, non-conforming finite elements for the Navier–Stokes equations for incompressible fluids and mixed-hybrid finite elements in stress analysis, electrostatics and ground water hydraulics) or to evolution problems (mass lumping and consistent mass schemes).

References

[a1]	P.G. Ciarlet, "The finite element method for elliptic problems" , North-Holland (1978)
[a2]	R. Courant, "Variational methods for the solution of problems of equilibrium and vibrations" Bull. Amer. Math. Soc. , 49 (1943) pp. 1–23
[a3]	R. Courant, D. Hilbert, "Methods of mathematical physics" , 1–2 , Interscience (1953–1962) (Translated from German)
[a4]	C. Cuvelier, A. Segal, A.A. van Steenhoven, "Finite element methods and Navier–Stokes equations" , Reidel (1986)
[a5]	I.M. Gel'fand, S.V. Fomin, "Calculus of variations" , Prentice-Hall (1963) (Translated from Russian)
[a6]	R. Glowinski, J.-L. Lions, R. Trémolières, "Numerical analysis of variational inequalities" , North-Holland (1981) (Translated from French)
[a7]	S.G. Mikhlin, "Variational methods in mathematical physics" , Macmillan (1964) (Translated from Russian)
[a8]	S.G. Mikhlin, "The problem of the minimum of a quadratic functional" , Holden-Day (1965) (Translated from Russian)
[a9]	S.L. Sobolev, "Applications of functional analysis in mathematical physics" , Amer. Math. Soc. (1963) (Translated from Russian)
[a10]	G.J. Fix, "An analyse of the finite element method" , Prentice-Hall (1973)
[a11]	R. Temam, "Navier–Stokes equations, theory and numerical analysis" , North-Holland (1979)
[a12]	O.C. Zienkiewicz, "The finite element method in engineering science" , McGraw-Hill (1977)

How to Cite This Entry:
Variational numerical methods. Encyclopedia of Mathematics. URL: http://encyclopediaofmath.org/index.php?title=Variational_numerical_methods&oldid=18886

This article was adapted from an original article by C. Cuvelier (originator), which appeared in Encyclopedia of Mathematics - ISBN 1402006098. See original article

Navigation

Tools

Namespaces

Variants

Views

Actions

Difference between revisions of "Variational numerical methods"

Latest revision as of 08:17, 9 January 2024

References

@@ Line 1: / Line 1: @@
+<!--
+v0962301.png
+$#A+1 = 113 n = 0
+$#C+1 = 113 : ~/encyclopedia/old_files/data/V096/V.0906230 Variational numerical methods
+Automatically converted into TeX, above some diagnostics.
+Please remove this comment and the {{TEX|auto}} line below,
+if TeX found to be correct.
+-->
+{{TEX|auto}}
+{{TEX|done}}
 ''(for differential equations)''
@@ Line 7: / Line 19: @@
 The fundamental criterion in the calculus of variations is as follows: A necessary and sufficient condition that the integral
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v0962301.png" /></td> </tr></table>
+$$
+J( v)  =  \int\limits _ { x _ {0} } ^ { {x _ 1 } } F( x, v, v _ {x} )  dx,
+$$
 with
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v0962302.png" /></td> </tr></table>
+$$
+v _ {x}  =
+\frac{dv }{dx }
+ ,
+$$
+be stationary when  $  v = u $
+is that  $  v = u $
+be an admissible function satisfying the [[Euler equation|Euler equation]]
+$$
-be stationary when <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v0962303.png" /> is that <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v0962304.png" /> be an admissible function satisfying the [[Euler equation|Euler equation]]
+\frac{\partial  F }{\partial  v }
+ -
+\frac{d }{dx }
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v0962305.png" /></td> </tr></table>
+\left (
+\frac{\partial  F }{\partial  v _ {x} }
+ \right )  =  0.
+$$
 Its solutions are called extremals and to solve the problem one has, among all the extremals, to find one which satisfies prescribed boundary conditions.
@@ Line 21: / Line 50: @@
 The problem of finding the extreme values of an integral can be extended to the case of several independent variables and multiple integrals of the type
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v0962306.png" /></td> </tr></table>
+$$
+J( v)  =  \int\limits F( x _ {1} \dots x _ {n} , v, v _ {x _ {1}  }
+\dots v _ {x _ {n}  } )  d \Omega ,
+$$
 with
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v0962307.png" /></td> </tr></table>
+$$
+v _ {x _ {i}  }  =
+\frac{\partial  v }{\partial  x _ {i} }
+ ,\ \
+d \Omega  =  dx _ {1} \dots dx _ {n} .
+$$
-The calculus of variations discusses the correspondence between the functional <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v0962308.png" /> to be extremized and the Euler equation for the solution <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v0962309.png" />. A solution of an extremal problem can thus be studied by looking at the corresponding Euler equation. Conversely, when a differential equation is given, a functional whose Euler equation is the given differential equation can sometimes be constructed. In this case the extremal problem and the Euler equation are in some sense equivalent. To solve Euler's equation, boundary conditions (for instance, of Dirichlet- or Neumann-type) have to be prescribed. Those boundary conditions which are imposed on the admissible functions are called essential (e.g. Dirichlet-type), those which are automatically satisfied by the solution of the extremal problem and which are not explicitly imposed on the set of admissible functions are called natural (e.g. Neumann-type).
+The calculus of variations discusses the correspondence between the functional  $  J( v) $
+to be extremized and the Euler equation for the solution  $  u $.
+A solution of an extremal problem can thus be studied by looking at the corresponding Euler equation. Conversely, when a differential equation is given, a functional whose Euler equation is the given differential equation can sometimes be constructed. In this case the extremal problem and the Euler equation are in some sense equivalent. To solve Euler's equation, boundary conditions (for instance, of Dirichlet- or Neumann-type) have to be prescribed. Those boundary conditions which are imposed on the admissible functions are called essential (e.g. Dirichlet-type), those which are automatically satisfied by the solution of the extremal problem and which are not explicitly imposed on the set of admissible functions are called natural (e.g. Neumann-type).
-As an example, let <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623010.png" /> be an open bounded domain in the <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623011.png" />-dimensional space <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623012.png" /> and let <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623013.png" /> be a real-valued function. Consider the problem of minimizing the functional
+As an example, let  $  \Omega $
+be an open bounded domain in the  $  n $-
+dimensional space  $  \mathbf R  ^ {n} $
+and let  $  f \in L _ {2} ( \Omega ) $
+be a real-valued function. Consider the problem of minimizing the functional
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623014.png" /></td> </tr></table>
+$$
+J( v)  =
+\frac{1 }{2}
-The set of admissible functions is the Hilbert space obtained by completing the space of infinitely-differentiable functions with compact support in <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623015.png" /> with respect to the Sobolev norm
+\int\limits _  \Omega  |  \mathop{\rm grad}  v |  ^ {2}  d \Omega -
+\int\limits _  \Omega  fv  d \Omega .
+$$
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623016.png" /></td> </tr></table>
+The set of admissible functions is the Hilbert space obtained by completing the space of infinitely-differentiable functions with compact support in  $  \Omega $
+with respect to the Sobolev norm
-It can be shown, using the Riesz representation theorem (cf. [[Riesz theorem(2)|Riesz theorem]]), that the extremal problem is equivalent (in the generalized, distributional sense) to the following Euler equation for <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623017.png" /> (i.e. Poisson equation with Dirichlet boundary conditions):
+$$
+\| v \| _ {1} = \left \{ \int\limits _  \Omega  |  \mathop{\rm grad}  v
+|  ^ {2}  d \Omega + \int\limits _  \Omega  v  ^ {2}  d \Omega
+\right \} ^ {1/2 } .
+$$
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623018.png" /></td> </tr></table>
+It can be shown, using the Riesz representation theorem (cf. [[Riesz theorem(2)|Riesz theorem]]), that the extremal problem is equivalent (in the generalized, distributional sense) to the following Euler equation for  $  u $(
+i.e. Poisson equation with Dirichlet boundary conditions):
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623019.png" /></td> </tr></table>
+$$
+-  \mathop{\rm div}   \mathop{\rm grad}  u  =  f \   \mathop{\rm in}  \Omega ,
+$$
+$$
+u  =  0 \   \mathop{\rm on}  \partial  \Omega  =  \textrm{ boundary }  \textrm{ of }  \Omega .
+$$
 There are some advantages in solving the extremal formulation (if it exists) instead of the differential equation, because some boundary conditions are automatically fulfilled and due to lower-order differentiation requirements.
-Consider the problem of minimizing the functional <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623020.png" /> over a function space <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623021.png" />, and let <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623022.png" /> be a basis for <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623023.png" />. Let <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623024.png" /> be an approximation of the space <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623025.png" /> spanned by the first <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623026.png" /> basis functions <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623027.png" />. The approximate problem is defined as: Find <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623028.png" /> minimizing the functional <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623029.png" /> on <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623030.png" />.
+Consider the problem of minimizing the functional  $  J $
+over a function space  $  V $,
+and let  $  \phi _ {1} \dots \phi _ {N} ,\dots $
+be a basis for  $  V $.
+Let  $  V _ {N} $
+be an approximation of the space  $  V $
+spanned by the first  $  N $
+basis functions  $  \phi _ {1} \dots \phi _ {N} $.
+The approximate problem is defined as: Find  $  u _ {N} \in V _ {N} $
+minimizing the functional  $  J $
+on  $  V _ {N} $.
+Since  $  u _ {N} $
+can be written as
-Since <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623031.png" /> can be written as
+$$
+u _ {N}  =  \sum _ {i= 1 } ^ { N }  \alpha _ {i} \phi _ {i} ,
+$$
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623032.png" /></td> </tr></table>
+with unknown  $  \alpha _ {1} \dots \alpha _ {n} $,
+the approximate problem reduces to: Find  $  \{ \alpha _ {1} \dots \alpha _ {N} \} $
+such that  $  J( \alpha _ {1} \dots \alpha _ {N} ) $
+takes its extremum value.
-with unknown <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623033.png" />, the approximate problem reduces to: Find <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623034.png" /> such that <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623035.png" /> takes its extremum value.
+This leads to the following  $  N $(
+necessary) conditions for the  $  N $
+unknowns  $  \alpha _ {1} \dots \alpha _ {N} $:
-This leads to the following <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623036.png" /> (necessary) conditions for the <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623037.png" /> unknowns <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623038.png" />:
+$$
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623039.png" /></td> </tr></table>
+\frac{\partial  J( \alpha _ {1} \dots \alpha _ {N} ) }{\partial  \alpha _ {i} }
+  =  0,\  i = 1 \dots N.
+$$
-The original extremal problem has been reduced to a system of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623040.png" /> equations for <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623041.png" /> unknowns, which can be solved numerically. The accuracy of the approximate solution <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623042.png" /> depends on the number <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623043.png" /> of basis functions. Generally, as <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623044.png" /> increases the accuracy improves, i.e. <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623045.png" /> converges to <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623046.png" />.
+The original extremal problem has been reduced to a system of  $  N $
+equations for  $  N $
+unknowns, which can be solved numerically. The accuracy of the approximate solution  $  u _ {N} $
+depends on the number  $  N $
+of basis functions. Generally, as  $  N $
+increases the accuracy improves, i.e.  $  u _ {N} $
+converges to  $  u $.
-The method that constructs a sequence <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623047.png" /> by determining the values of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623048.png" />, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623049.png" /> is called the [[Ritz method|Ritz method]]. Inclusion of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623050.png" /> in <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623051.png" /> is not essential; for convergence the approximate spaces must satisfy the following density property: <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623052.png" /> dense in <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623053.png" />.
+The method that constructs a sequence  $  u _ {1,\ } u _ {2,\dots} $
+by determining the values of  $  \alpha _ {i} $,
+$  i = 1, 2 \dots $
+is called the [[Ritz method|Ritz method]]. Inclusion of  $  V _ {m} $
+in  $  V _ {m+ 1 }  $
+is not essential; for convergence the approximate spaces must satisfy the following density property:  $  \cup_{m=1}^  \infty  V _ {m} $
+dense in  $  V $.
-The [[Galerkin method|Galerkin method]], which can be considered as a generalization of the Ritz method, is directly applicable to the differential equation, irrespective of the existence of an equivalent extremal formulation. Galerkin's method can be illustrated as follows: Consider the following linear second-order differential equation with (essential) Dirichlet boundary conditions (a boundary value problem):
+The [[Galerkin method]], which can be considered as a generalization of the Ritz method, is directly applicable to the differential equation, irrespective of the existence of an equivalent extremal formulation. Galerkin's method can be illustrated as follows: Consider the following linear second-order differential equation with (essential) Dirichlet boundary conditions (a boundary value problem):
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623054.png" /></td> </tr></table>
+$$
+{\mathcal L} u  =  f \   \mathop{\rm in}  \Omega \subset  \mathbf R  ^ {n} ,
+$$
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623055.png" /></td> </tr></table>
+$$
+u  =  0 \   \mathop{\rm on}  \partial  \Omega .
+$$
-The solution is sought in a function space V which is supposed to have a basis <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623056.png" />. Any element <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623057.png" /> can be written as an infinite linear combination of these basis functions. For arbitrary <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623058.png" />, the expression <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623059.png" /> is called the residual. The problem is to find an element <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623060.png" /> for which the residual vanishes. This amounts to saying that the solution <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623061.png" /> is characterized by the following projection requirements:
+The solution is sought in a function space V which is supposed to have a basis  $  \{ \phi _ {1} \dots \phi _ {N} ,\dots \} $.
+Any element  $  w \in V $
+can be written as an infinite linear combination of these basis functions. For arbitrary  $  w \in V $,
+the expression  $  {\mathcal L} w - f = R( w) $
+is called the residual. The problem is to find an element  $  u \in V $
+for which the residual vanishes. This amounts to saying that the solution  $  u $
+is characterized by the following projection requirements:
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623062.png" /></td> </tr></table>
+$$
+\int\limits _  \Omega  ( {\mathcal L} u - f  ) \phi _ {i}  d \Omega  =  0,\ \
+i = 1, 2,\dots.
+$$
-Reduction of the order of differentiation by Green's formula and substitution of the boundary conditions leads to the following variational (or weak) formulation of the boundary value problem: Find <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623063.png" /> such that
+Reduction of the order of differentiation by Green's formula and substitution of the boundary conditions leads to the following variational (or weak) formulation of the boundary value problem: Find  $  u \in V $
+such that
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623064.png" /></td> </tr></table>
+$$
+a( u, \phi _ {i} )  =  \int\limits _  \Omega  f \phi _ {i}  d \Omega ,
+\  i = 1, 2 \dots
+$$
-where <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623065.png" /> is a bilinear form on <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623066.png" />. When <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623067.png" /> is, for instance, the Laplace operator, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623068.png" /> is given by
+where  $  a( u, \phi ) $
+is a bilinear form on  $  V \times V $.
+When  $  {\mathcal L} $
+is, for instance, the Laplace operator,  $  a( u, \phi ) $
+is given by
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623069.png" /></td> </tr></table>
+$$
+a( u, \phi )  =  \int\limits _  \Omega   \mathop{\rm grad}
+ u \cdot  \mathop{\rm grad}  \phi  d \Omega .
+$$
-The Galerkin method consists of taking a finite-dimensional subspace <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623070.png" /> of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623071.png" /> spanned by <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623072.png" /> basis functions <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623073.png" />. The approximate finite-dimensional problem is defined as: Find <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623074.png" /> such that
+The Galerkin method consists of taking a finite-dimensional subspace  $  V _ {N} $
+of  $  V $
+spanned by  $  N $
+basis functions  $  \phi _ {1} \dots \phi _ {N} $.
+The approximate finite-dimensional problem is defined as: Find  $  u _ {N} \in V _ {N} $
+such that
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623075.png" /></td> </tr></table>
+$$
+a( u _ {N} , \phi _ {i} )  =  \int\limits _  \Omega  f \phi _ {i}  d \Omega ,
+\  i = 1 \dots N.
+$$
-Since <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623076.png" />, it has the form
+Since  $  u _ {N} \in V _ {N} $,
+it has the form
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623077.png" /></td> </tr></table>
+$$
+u _ {N}  =  \sum_{j=1} ^ { N }  \alpha _ {j} \phi _ {j} ,
+$$
-and the approximate problem is equivalent to the following system of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623078.png" /> linear algebraic equations for the unknowns <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623079.png" />:
+and the approximate problem is equivalent to the following system of  $  N $
+linear algebraic equations for the unknowns  $  \alpha _ {1} \dots \alpha _ {N} $:
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623080.png" /></td> </tr></table>
+$$
+\sum _ {j=1} ^ { N }  \alpha _ {j} a( \phi _ {i} , \phi _ {j} )  = \
+\int\limits _  \Omega  f \phi _ {i}  d \Omega ,\  i= 1 \dots n .
+$$
 This system can be solved by Gaussian elimination or by iteration.
-When the coefficients of the differential operator <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623081.png" /> and the function <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623082.png" /> are periodic in space and when sine or cosine basis functions are chosen, the Galerkin method is equivalent to the Fourier method and can be efficiently implemented using a fast Fourier transform (cf. [[Fourier transform, discrete|Fourier transform, discrete]]). The Galerkin method can also be applied to the numerical solution of evolution equations, leading to a system of ordinary differential equations in time which can be solved numerically by integration techniques.
+When the coefficients of the differential operator  $  {\mathcal L} $
+and the function  $  f $
+are periodic in space and when sine or cosine basis functions are chosen, the Galerkin method is equivalent to the Fourier method and can be efficiently implemented using a fast Fourier transform (cf. [[Fourier transform, discrete|Fourier transform, discrete]]). The Galerkin method can also be applied to the numerical solution of evolution equations, leading to a system of ordinary differential equations in time which can be solved numerically by integration techniques.
-Both the Ritz method and the Galerkin method transform the infinite-dimensional problem into a (finite-dimensional) system of linear equations. The finite-element method is a systematic technique for generating basis functions on arbitrary domains <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623083.png" /> which gives rise to a sparse matrix <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623084.png" /> (i.e. a matrix with many zero entries). The finite-element method consists of three steps:
+Both the Ritz method and the Galerkin method transform the infinite-dimensional problem into a (finite-dimensional) system of linear equations. The finite-element method is a systematic technique for generating basis functions on arbitrary domains  $  \Omega $
+which gives rise to a sparse matrix  $  A $(
+i.e. a matrix with many zero entries). The finite-element method consists of three steps:
-The first step is to divide the domain <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623085.png" /> into a finite number of non-overlapping subregions (elements) the length of which does not exceed <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623086.png" />. Standard examples of elements are subintervals (<img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623087.png" />), triangles in <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623088.png" /> dimensions and tetrahedra in <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623089.png" /> dimensions.
+The first step is to divide the domain  $  \overline \Omega \; = \Omega \cup \partial  \Omega \subset  \mathbf R  ^ {n} $
+into a finite number of non-overlapping subregions (elements) the length of which does not exceed  $  h > 0 $.
+Standard examples of elements are subintervals ( $  n = 1 $),
+triangles in  $  2 $
+dimensions and tetrahedra in  $  3 $
+dimensions.
-In the second step, a finite number of points (called nodal points) are chosen in each element. Examples are: begin- and end-point of the subintervals; the vertices of the triangles; or the vertices of the tetrahedra. More complicated examples are begin-, end- and mid-points of subintervals, or vertices and mid-points of sides of triangles. The (finite) set of all nodal points will be denoted by <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623090.png" />.
+In the second step, a finite number of points (called nodal points) are chosen in each element. Examples are: begin- and end-point of the subintervals; the vertices of the triangles; or the vertices of the tetrahedra. More complicated examples are begin-, end- and mid-points of subintervals, or vertices and mid-points of sides of triangles. The (finite) set of all nodal points will be denoted by  $  {\mathcal P} = \{ P _ {1} \dots P _ {N} \} $.
-In the third step, a continuous basis function <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623091.png" /> is defined on <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623092.png" /> for each nodal point <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623093.png" />. These basis functions satisfy the following properties: 1) <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623094.png" /> takes the value <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623095.png" /> at nodal point <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623096.png" /> and vanishes at all the other nodal points; and 2) <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623097.png" /> has a prescribed shape on each element (depending on the number of nodal points per element). For example, in the <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623098.png" />-dimensional case: linear basis functions with <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v09623099.png" /> nodal points per element or quadratic basis functions with <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v096230100.png" /> nodal points per element.
+In the third step, a continuous basis function  $  \phi _ {i} $
+is defined on  $  \overline \Omega \; $
+for each nodal point  $  P _ {i} \in {\mathcal P} $.
+These basis functions satisfy the following properties: 1)  $  \phi _ {i} $
+takes the value  $  1 $
+at nodal point  $  P _ {i} $
+and vanishes at all the other nodal points; and 2)  $  \phi _ {i} $
+has a prescribed shape on each element (depending on the number of nodal points per element). For example, in the  $  n $-
+dimensional case: linear basis functions with  $  n+ 1 $
+nodal points per element or quadratic basis functions with  $  ( n+ 1)( n+ 2)/2 $
+nodal points per element.
-Using these basis functions an approximate solution <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v096230101.png" /> can be constructed. In fact, the function
+Using these basis functions an approximate solution  $  u _ {N} $
+can be constructed. In fact, the function
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v096230102.png" /></td> </tr></table>
+$$
+u _ {N(} x)  =  \sum _ {j= 1 } ^ { N }  \widetilde{u}  _ {j} \phi _ {j} ( x)
+$$
-is a linear combination of the basis functions <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v096230103.png" />, continuous on <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v096230104.png" />, and with prescribed behaviour on the elements. Due to the particular construction of the basis functions, the parameter <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v096230105.png" /> is precisely the value of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v096230106.png" /> at nodal point <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v096230107.png" />.
+is a linear combination of the basis functions  $  \phi _ {j} $,
+continuous on  $  \overline \Omega \; $,
+and with prescribed behaviour on the elements. Due to the particular construction of the basis functions, the parameter  $  \widetilde{u}  _ {j} $
+is precisely the value of  $  u _ {N} $
+at nodal point  $  P _ {j} $.
 The resulting system of linear equations can be written as
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v096230108.png" /></td> </tr></table>
+$$
+\sum _{j=1} ^ { N }  \widetilde{u}  _ {j} a( \phi _ {i} , \phi _ {j} )  = \
+\int\limits _  \Omega  f \phi _ {i}  d \Omega ,\  i= 1 \dots n,
+$$
-where the matrix entry <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v096230109.png" /> differs from zero only if the nodal points <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v096230110.png" /> and <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v096230111.png" /> belong to the same element. This implies that the matrix (often called the stiffness matrix) is sparse.
+where the matrix entry  $  a( \phi _ {i} , \phi _ {j} ) $
+differs from zero only if the nodal points  $  P _ {i} $
+and  $  P _ {j} $
+belong to the same element. This implies that the matrix (often called the stiffness matrix) is sparse.
-The convergence of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v096230112.png" /> to <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/v/v096/v096230/v096230113.png" /> can be proved if the set of elements satisfies certain regularity properties. To improve the accuracy, polynomials of higher degree per element are used. Curved boundaries can be taken into account by using isoparametric elements.
+The convergence of  $  u _ {N} $
+to  $  u $
+can be proved if the set of elements satisfies certain regularity properties. To improve the accuracy, polynomials of higher degree per element are used. Curved boundaries can be taken into account by using isoparametric elements.
 Generalizations and extensions of the finite-element method exist to problems of higher order (for instance, mixed finite elements for the biharmonic equation, cf. [[Biharmonic function|Biharmonic function]]), to problems with specific difficulties (for instance, non-conforming finite elements for the [[Navier–Stokes equations|Navier–Stokes equations]] for incompressible fluids and mixed-hybrid finite elements in stress analysis, electrostatics and ground water hydraulics) or to evolution problems (mass lumping and consistent mass schemes).