The Finite Element Method (FEM) is arguably the most powerful method known for the numerical solution of boundary- and initial-value problems characterized by partial differential equations.
Consequently, it has had a monumental impact on virtually all areas of engineering and applied science.
There are two fundamental attributes of the method that are at the heart of its great utility and success.
Firstly, it is based on the idea of partitioning bounded domains in into a number of small, non-overlapping subdomains, the finite elements, over which functions are approximated by local functions, generally polynomials.
Secondly, the boundary- and initial-value problems, to which the method is applied, are formulated in a so-called weak, or integral form, so that the contributions of each subdomain to the global integrals sum up to produce an integral characterizing the problem over the whole domain.
Some point to the first attribute of FEMs as its most important: the systemic representation of approximations of function spaces. From the viewpoint of approximation theory, the method provides a systematic approach to piecewise approximation over subdomains that produces sequences of functions that can approximate arbitrary members of, say, Sobolev spaces, arbitrarily closely in appropriate norms (e.g. Sobolev norms). This attribute is referred to as a property of "interpolation" for FEMs. It basically lifts the idea of approximate function representation from the classical notion of interpolation across values of functions or values of derivatives to a general approach for approximating functions with generalized derivatives in spaces such as or
But, were it only its ability to represent functions on quite general domains in the FEM would not have attained its popularity in science and engineering.
It is the powerful notion of weak formulations of boundary- and initial-value problems that
enables the summation of element contributions to integrals over the whole domain, a property
that allows one to conceptualize a formulation of the problem at hand over a simple domain,
the finite element, so as to approximate it locally. Then, these local subdomain approximations,
as noted earlier, are summed up to determine the contributions from the full domain. This is the
essence of the FEM.
The basic idea can be demonstrated by a simple model problem. Consider the Poisson problem on a two-dimensional domain which involves seeking a real-valued function such that
where is the Laplacian, is given source data, and is an open bounded domain in with boundary Formally, the differential equation in (1) is multiplied by a test function integrated by parts via Green's theorem, to obtain
where is the outward unit vector normal to the boundary If now one demands that on (1) is formally equivalent to the following weak or integral form of the boundary-value problem:
Find such that
Here is called the space of trial functions and the space of test functions. Conceptually, the formulation (2) is "weaker" than (1) as it imposes weaker conditions on the smoothness of the solutions and test functions In fact
and, in this case, The space is usually denoted
where is the Sobolev space of functions on with generalized derivatives a special case of the of functions with derivatives of orders in
Problem (2) is a simple example of a broad class of linear boundary- and initial-value problems of the form
where is a bilinear form mapping (reflexive Banach spaces) into and is a continuous linear functional defined on
The standard finite element approximation of (2) amounts to the construction of Galerkin or weighted residual approximations of (1) over finite dimensional subspaces
and of and respectively. That is, one seeks
with being a discretization parameter generally chosen as a measure of the mesh size. The great utility of FEM's arises from its approach to constructing the approximation spaces and for functions defined on quite general domains
Returning to the example (1) (or (2)), consider the case in which is the domain shown in Figure 1, partitioned with a mesh of triangular elements, as shown. This mesh is viewed as one generated by a sequence of maps of a master triangle into the triangles so that, when connected along their edges and at their vertices, cover or produce a close approximation of Suppose there are elements in the mesh. Over a typical isolated triangle assume, for example, that the restriction of the test function varies linearly. Then,
Figure 1: A triangular finite element partition of a domain into elements overwhich test functions are piecewise linear functions of the coordinates
The constants and are determined so that is completely determined by its values at the vertices of called the element nodes. If and denote the values of at nodes numbered 1, 2, and 3, then one easily determines that can be written,
where the are the element shape functions,
are the coordinates of node and is the area of These linear functions have the property that
All the elements are then connected together to form the full computational domain (or an approximation of it), and in so doing create global piecewise linear basis functions corresponding to each node in the connected mesh. These "pyramid" or "hat" functions result from patching the element shape functions together, as indicated in Figure 2. Now the (global) test functions are linear combinations of these basis functions: For a mesh with interior nodes (boundary nodes are excluded since on the boundary),
Figure 2: A global basis function for the spaces and made up by piecing together element shape functions. Note that the basis function is defined everywhere in but is nonzero only over the patch of elements surrounding node
In the finite element approximation of (2), one seeks
that satisfies the weak problem (4) where
for arbitrary coefficients Here are the elements of the global stiffness matrix:
and are the components of the load vector,
This leads to a system of linear equations for the coefficients called the degrees of freedom:
The final solution is then,
A property of fundamental importance is that this entire process can essentially be done at the local element level, and then summed up to apply to the global version of the problem. Thus, on element the local equalities for element are
where and are the elements of the local stiffness matrix and of the local load vector for element respectively, and are element interface flux terms:
In the expression for is the exact solution of the boundary-value problem (2); while it is unknown, it does not enter the discrete global problem because
In practice, the local element matrices (6) are integrated using numerical quadrature methods when higher-order shape functions are used. In general, FEM computer programs are written so that element matrices such as those in (6) are calculated from a family of element types, and these are used to generate the global matrices of (5) when the elements are connected together to form the global element mesh. Of course, there are many other possible choices for finite elements, including for instance, quadratic approximations on triangles with 6 nodes, cubic with 10 nodes, quadrilateral elements, etc., and generalizations to tetrahedral and hexahedral elements for three-dimensional elements are commonly used.
The mathematical properties of FEM's and the fact that convergent and accurate approximations of solutions to problems defined on domains of quite general geometry are worth noting. Returning to the general weak boundary-value problem (3), one observes that is typically assumed to be continuous on into with continuity constant and to satisfy an "inf-sup" condition
where is a constant. Under the conditions that and are continuous (on and respectively), that for all and that (7) holds, it can be shown that (3) possesses a unique solution.
To determine the FE approximation, the domain is partitioned into subdomains such that
As noted earlier, each element is regarded as the image of a master subdomain (a master element) under an invertible element map In this way, a sequence of subspaces is constructed such that
where is the space of local FEM approximations, e.g.
or
Then the FEM approximation of (3) reduces to the sequence of discrete problems (4). From (3) and (4), one can derive the standard "orthogonality condition"
So, in this sense, the FEM (Galerkin) approximation of (4) is
optimal: the error is "orthogonal" to the space of approximations with respect to the bilinear form
Under reasonable assumptions on the sequence of meshes producing the subspaces one can approximate functions in with interpolation error satisfying
where is the highest order of complete polynomials contained in the space of local FEM approximations (as in (8)), is a projection operator of into that preserves polynomials of degree for triangular elements, and is a constant.
When interpolation properties such as this hold, one can typically obtain a priori error estimates for elliptic boundary-value problems of the form
where now and Thus, the rate of convergence of such methods in the -norm is
The constant is independent of and but depends upon the continuity constant of and the constant of (7).
Many variations of such results can be obtained. Among them are cases in which the local spectral order is an approximation parameter. These characterize the "-versions" of the FEM. Likewise, both and can be varied, resulting in - FEMs. The "-versions" of the FEM commonly refers to the case where is uniform over all elements.
The field of FEM's, both as a tool for solving problems of science and engineering
and as an area of research in numerical mathematics, is large and expanding and
continues to be a vital area of study and practical application.
[edit] References