Difference between revisions of "Butcher series"

Revision as of 06:29, 30 May 2020

B-series

Let $ y : \mathbf R \rightarrow {\mathbf R ^ {d} } $ be an analytic function satisfying an ordinary differential equation $ y ^ \prime ( x ) = f ( y ( x ) ) $. A Butcher series for $ y $ is a series of the form

$$ B ( a,y ( x ) ) = \sum _ {t \in T } a ( t ) F ( t ) ( y ( x ) ) { \frac{h ^ {\rho ( t ) } }{\rho ( t ) ! } } , $$

where $ T $ is the set of rooted trees, $ a ( t ) \in \mathbf R $ is the coefficient for the series associated with the tree $ t $, $ F ( t ) ( y ( x ) ) \in \mathbf R ^ {d} $ is the elementary differential associated with the tree $ t $, $ h \in \mathbf R $ is a stepsize or $ x $- increment, and $ \rho ( t ) $, often called the order of the tree $ t $, is the number of nodes in the tree $ t $. All terms will be defined below. For a more thorough discussion of Butcher series, see [a1] or [a2].

Definitions

A tree is a connected acyclic graph (cf. also Graph). A rooted tree is a tree with a distinguished node, called the root (which is drawn at the bottom of the sample trees in the Table). Two rooted trees $ t $ and $ {\widehat{t} } $ are isomorphic (i.e., equivalent) if and only if there is a bijection $ \sigma $ from the nodes of $ t $ to the nodes of $ {\widehat{t} } $ such that $ \sigma $ maps the root of $ t $ to the root of $ {\widehat{t} } $ and any two nodes $ n _ {1} $ and $ n _ {2} $ are connected by an edge in $ t $ if and only if $ \sigma ( n _ {1} ) $ and $ \sigma ( n _ {2} ) $ are connected by an edge in $ {\widehat{t} } $.

The following bracket notation for rooted trees is very useful. Let $ \emptyset $ and $ \bullet $ be the trees with zero and one node, respectively. For rooted trees $ t _ {1} \dots t _ {m} $ with $ \rho ( t _ {i} ) \geq 1 $ for $ i = 1 \dots m $, let $ t = [ t _ {1} \dots t _ {m} ] $ be the rooted tree formed by connecting a new root to the roots of each of $ t _ {1} \dots t _ {m} $. Clearly, $ \rho ( t ) = 1 + \sum _ {i = 1 } ^ {m} \rho ( t _ {i} ) $ and any rooted tree formed by permuting the subtrees $ t _ {1} \dots t _ {m} $ of $ t $ is isomorphic to $ t $. The rooted trees with at most four nodes are displayed in the table below.

$ t _ {0} = $	$ \emptyset $
$ t _ {1} = $
$ t _ {2} = [ t _ {1} ] = $
$ t _ {3} = [ t _ {1} ,t _ {1} ] = $
$ t _ {4} = [ t _ {2} ] = $
$ t _ {5} = [ t _ {1} ,t _ {1} ,t _ {1} ] = $
$ t _ {6} = [ t _ {1} ,t _ {2} ] = $
$ t _ {7} = [ t _ {3} ] = $
$ t _ {8} = [ t _ {4} ] = $

Using this bracket notation for rooted trees, one can recursively define elementary differentials, as follows. Let

$$ F ( \emptyset ) ( y ( x ) ) = y ( x ) , \quad F ( \bullet ) ( y ( x ) ) = f ( y ( x ) ) , $$

and, for $ t = [ t _ {1} \dots t _ {m} ] $ with $ \rho ( t _ {i} ) \geq 1 $ for $ i = 1 \dots m $,

$$ F ( t ) ( y ( x ) ) = $$

$$ = f ^ {( m ) } ( y ( x ) ) ( F ( t _ {1} ) ( y ( x ) ) \dots F ( t _ {m} ) ( y ( x ) ) ) , $$

where $ f ^ {( m ) } ( y ( x ) ) $, the $ m $ th derivative of $ f $ with respect to $ y $, is a multi-linear mapping.

Since $ y ( x ) $ satisfies $ y ^ \prime ( x ) = f ( y ( x ) ) $, it is not hard to show that

$$ y ^ {( i ) } ( x ) = \sum _ {t \in T _ {i} } \alpha ( t ) F ( t ) ( y ( x ) ) , $$

where $ y ^ {( i ) } ( x ) $ is the $ i $ th derivative of $ y $ with respect to $ x $, $ T _ {i} = \{ {t \in T } : {\rho ( t ) = i } \} $, and $ \alpha ( t ) $ is the number of distinct ways of labeling the nodes of $ t $ with the integers $ \{ 1 \dots \rho ( t ) \} $ such that the labels increase as you follow any path from the root to a leaf of $ t $. Consequently, one can write the Taylor series for $ y ( x ) $ as a Butcher series:

$$ \tag{a1 } y ( x + h ) = \sum _ {i = 0 } ^ \infty y ^ {( i ) } ( x ) { \frac{h ^ {i} }{i! } } = $$

$$ = \sum _ {t \in T } \alpha ( t ) F ( t ) ( y ( x ) ) { \frac{h ^ {\rho ( t ) } }{\rho ( t ) ! } } . $$

The importance of Butcher series stems from their use to derive and to analyze numerical methods for differential equations. For example, consider the $ s $- stage Runge–Kutta formula (cf. Runge–Kutta method)

$$ \tag{a2 } y _ {n + 1 } = y _ {n} + h \sum _ {i = 1 } ^ { s } b _ {i} k _ {i} , $$

$$ k _ {i} = f \left ( y _ {n} + h \sum _ {j = 1 } ^ { s } a _ {ij } k _ {j} \right ) , $$

for the ordinary differential equation $ y ^ \prime ( x ) = f ( y ( x ) ) $. Let $ b = ( b _ {1} \dots b _ {s} ) ^ {T} \in \mathbf R ^ {s} $ be the vector of weights and $ A = [ a _ {ij } ] \in \mathbf R ^ {s \times s } $ the coefficient matrix for (a2). Then it can be shown that $ y _ {n + 1 } $ can be written as the Butcher series

$$ \tag{a3 } y _ {n + 1 } = \sum _ {t \in T } \alpha ( t ) \psi ( t ) F ( t ) ( y _ {n} ) { \frac{h ^ {\rho ( t ) } }{\rho ( t ) ! } } , $$

where the coefficients $ \psi ( t ) \in \mathbf R $, defined below, depend only on the rooted tree $ t $ and the coefficients of the formula (a2).

To define $ \psi ( t ) $, first let

$$ \phi ( \emptyset ) = ( 1 \dots 1 ) ^ {T} \in \mathbf R ^ {s} , $$

$$ \phi ( \bullet ) = A \phi ( \emptyset ) , $$

and then, for $ t = [ t _ {1} \dots t _ {m} ] $ with $ \rho ( t _ {i} ) \geq 1 $ for $ i = 1 \dots m $, define $ \phi ( t ) $ recursively by

$$ \phi ( t ) = \rho ( t ) A ( \phi ( t _ {1} ) \dots \phi ( t _ {m} ) ) , $$

where the product of $ \phi $' s in the bracket is taken component-wise. Then

$$ \psi ( \emptyset ) = 1, $$

$$ \psi ( \bullet ) = \sum _ {i = 1 } ^ { s } b _ {i} , $$

and, for $ t = [ t _ {1} \dots t _ {m} ] $ with $ \rho ( t _ {i} ) \geq 1 $ for $ i = 1 \dots m $,

$$ \psi ( t ) = \rho ( t ) b ^ {T} ( \phi ( t _ {1} ) \dots \phi ( t _ {m} ) ) , $$

where, again, the product of $ \phi $' s in the bracket is taken componentwise.

Assuming that $ y ( x ) = y _ {n} $ and that $ \psi ( t ) = 1 $ for all trees $ t $ of order $ \leq p $, it follows immediately from (a1) and (a3) that

$$ y _ {n + 1 } = y ( x + h ) + {\mathcal O} ( h ^ {p + 1 } ) . $$

The order of (a2) is the largest such $ p $.

References

[a1]	J.C. Butcher, "The numerical analysis of ordinary differential equations" , Wiley (1987)
[a2]	E. Hairer, S.P. Nørsett, G. Wanner, "Solving ordinary differential equations I: nonstiff problems" , Springer (1987)

How to Cite This Entry:
Butcher series. Encyclopedia of Mathematics. URL: http://encyclopediaofmath.org/index.php?title=Butcher_series&oldid=46177

This article was adapted from an original article by K.R. Jackson (originator), which appeared in Encyclopedia of Mathematics - ISBN 1402006098. See original article

Navigation

Tools

Namespaces

Variants

Views

Actions

Difference between revisions of "Butcher series"

Revision as of 06:29, 30 May 2020

Definitions

References

@@ Line 1: / Line 1: @@
+<!--
+b1110701.png
+$#A+1 = 106 n = 0
+$#C+1 = 106 : ~/encyclopedia/old_files/data/B111/B.1101070 Butcher series,
+Automatically converted into TeX, above some diagnostics.
+Please remove this comment and the {{TEX|auto}} line below,
+if TeX found to be correct.
+-->
+{{TEX|auto}}
+{{TEX|done}}
 ''B-series''
-Let <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b1110701.png" /> be an [[Analytic function|analytic function]] satisfying an ordinary differential equation <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b1110702.png" />. A Butcher series for <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b1110703.png" /> is a series of the form
+Let  $  y : \mathbf R \rightarrow {\mathbf R  ^ {d} } $
+be an [[Analytic function|analytic function]] satisfying an ordinary differential equation  $  y  ^  \prime  ( x ) = f ( y ( x ) ) $.
+A Butcher series for  $  y $
+is a series of the form
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b1110704.png" /></td> </tr></table>
+$$
+B ( a,y ( x ) ) = \sum _ {t \in T } a ( t ) F ( t ) ( y ( x ) ) {
+\frac{h ^ {\rho ( t ) } }{\rho ( t ) ! }
+ } ,
+$$
-where <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b1110705.png" /> is the set of rooted trees, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b1110706.png" /> is the coefficient for the series associated with the tree <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b1110707.png" />, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b1110708.png" /> is the elementary differential associated with the tree <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b1110709.png" />, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107010.png" /> is a stepsize or <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107011.png" />-increment, and <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107012.png" />, often called the order of the tree <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107013.png" />, is the number of nodes in the tree <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107014.png" />. All terms will be defined below. For a more thorough discussion of Butcher series, see [[#References|[a1]]] or [[#References|[a2]]].
+where  $  T $
+is the set of rooted trees,  $  a ( t ) \in \mathbf R $
+is the coefficient for the series associated with the tree  $  t $,
+$  F ( t ) ( y ( x ) ) \in \mathbf R  ^ {d} $
+is the elementary differential associated with the tree  $  t $,
+$  h \in \mathbf R $
+is a stepsize or  $  x $-
+increment, and  $  \rho ( t ) $,
+often called the order of the tree  $  t $,
+is the number of nodes in the tree  $  t $.
+All terms will be defined below. For a more thorough discussion of Butcher series, see [[#References|[a1]]] or [[#References|[a2]]].
 ==Definitions==
-A tree is a connected acyclic graph (cf. also [[Graph|Graph]]). A rooted tree is a tree with a distinguished node, called the root (which is drawn at the bottom of the sample trees in the Table). Two rooted trees <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107015.png" /> and <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107016.png" /> are isomorphic (i.e., equivalent) if and only if there is a [[Bijection|bijection]] <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107017.png" /> from the nodes of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107018.png" /> to the nodes of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107019.png" /> such that <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107020.png" /> maps the root of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107021.png" /> to the root of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107022.png" /> and any two nodes <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107023.png" /> and <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107024.png" /> are connected by an edge in <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107025.png" /> if and only if <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107026.png" /> and <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107027.png" /> are connected by an edge in <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107028.png" />.
+A tree is a connected acyclic graph (cf. also [[Graph|Graph]]). A rooted tree is a tree with a distinguished node, called the root (which is drawn at the bottom of the sample trees in the Table). Two rooted trees  $  t $
+and  $  {\widehat{t}  } $
+are isomorphic (i.e., equivalent) if and only if there is a [[Bijection|bijection]]  $  \sigma $
+from the nodes of  $  t $
+to the nodes of  $  {\widehat{t}  } $
+such that  $  \sigma $
+maps the root of  $  t $
+to the root of  $  {\widehat{t}  } $
+and any two nodes  $  n _ {1} $
+and  $  n _ {2} $
+are connected by an edge in  $  t $
+if and only if  $  \sigma ( n _ {1} ) $
+and  $  \sigma ( n _ {2} ) $
+are connected by an edge in  $  {\widehat{t}  } $.
-The following bracket notation for rooted trees is very useful. Let <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107029.png" /> and <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107030.png" /> be the trees with zero and one node, respectively. For rooted trees <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107031.png" /> with <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107032.png" /> for <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107033.png" />, let <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107034.png" /> be the rooted tree formed by connecting a new root to the roots of each of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107035.png" />. Clearly, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107036.png" /> and any rooted tree formed by permuting the subtrees <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107037.png" /> of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107038.png" /> is isomorphic to <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107039.png" />. The rooted trees with at most four nodes are displayed in the table below.
+The following bracket notation for rooted trees is very useful. Let  $  \emptyset $
+and  $  \bullet $
+be the trees with zero and one node, respectively. For rooted trees  $  t _ {1} \dots t _ {m} $
+with  $  \rho ( t _ {i} ) \geq  1 $
+for  $  i = 1 \dots m $,
+let  $  t = [ t _ {1} \dots t _ {m} ] $
+be the rooted tree formed by connecting a new root to the roots of each of  $  t _ {1} \dots t _ {m} $.
+Clearly,  $  \rho ( t ) = 1 + \sum _ {i = 1 }   ^ {m} \rho ( t _ {i} ) $
+and any rooted tree formed by permuting the subtrees  $  t _ {1} \dots t _ {m} $
+of  $  t $
+is isomorphic to  $  t $.
+The rooted trees with at most four nodes are displayed in the table below.
-<table border="0" cellpadding="0" cellspacing="0" style="background-color:black;"> <tr><td> <table border="0" cellspacing="1" cellpadding="4" style="background-color:black;"> <tbody> <tr> <td colname="1" style="background-color:white;" colspan="1"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107040.png" /></td> <td colname="2" style="background-color:white;" colspan="1"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107041.png" /></td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107042.png" /></td> <td colname="2" style="background-color:white;" colspan="1">
+<table border="0" cellpadding="0" cellspacing="0" style="background-color:black;"> <tr><td> <table border="0" cellspacing="1" cellpadding="4" style="background-color:black;"> <tbody> <tr> <td colname="1" style="background-color:white;" colspan="1"> $  t _ {0} = $
+</td> <td colname="2" style="background-color:white;" colspan="1"> $  \emptyset $
+</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"> $  t _ {1} = $
+</td> <td colname="2" style="background-color:white;" colspan="1">
 <img style="border:1px solid;" src="https://www.encyclopediaofmath.org/legacyimages/common_img/" />
-</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107043.png" /></td> <td colname="2" style="background-color:white;" colspan="1">
+</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"> $  t _ {2} = [ t _ {1} ] = $
+</td> <td colname="2" style="background-color:white;" colspan="1">
 <img style="border:1px solid;" src="https://www.encyclopediaofmath.org/legacyimages/common_img/" />
-</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107044.png" /></td> <td colname="2" style="background-color:white;" colspan="1">
+</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"> $  t _ {3} = [ t _ {1} ,t _ {1} ] = $
+</td> <td colname="2" style="background-color:white;" colspan="1">
 <img style="border:1px solid;" src="https://www.encyclopediaofmath.org/legacyimages/common_img/" />
-</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107045.png" /></td> <td colname="2" style="background-color:white;" colspan="1">
+</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"> $  t _ {4} = [ t _ {2} ] = $
+</td> <td colname="2" style="background-color:white;" colspan="1">
 <img style="border:1px solid;" src="https://www.encyclopediaofmath.org/legacyimages/common_img/" />
-</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107046.png" /></td> <td colname="2" style="background-color:white;" colspan="1">
+</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"> $  t _ {5} = [ t _ {1} ,t _ {1} ,t _ {1} ] = $
+</td> <td colname="2" style="background-color:white;" colspan="1">
 <img style="border:1px solid;" src="https://www.encyclopediaofmath.org/legacyimages/common_img/" />
-</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107047.png" /></td> <td colname="2" style="background-color:white;" colspan="1">
+</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"> $  t _ {6} = [ t _ {1} ,t _ {2} ] = $
+</td> <td colname="2" style="background-color:white;" colspan="1">
 <img style="border:1px solid;" src="https://www.encyclopediaofmath.org/legacyimages/common_img/" />
-</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107048.png" /></td> <td colname="2" style="background-color:white;" colspan="1">
+</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"> $  t _ {7} = [ t _ {3} ] = $
+</td> <td colname="2" style="background-color:white;" colspan="1">
 <img style="border:1px solid;" src="https://www.encyclopediaofmath.org/legacyimages/common_img/" />
-</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107049.png" /></td> <td colname="2" style="background-color:white;" colspan="1">
+</td> </tr> <tr> <td colname="1" style="background-color:white;" colspan="1"> $  t _ {8} = [ t _ {4} ] = $
+</td> <td colname="2" style="background-color:white;" colspan="1">
 <img style="border:1px solid;" src="https://www.encyclopediaofmath.org/legacyimages/common_img/" />
@@ Line 50: / Line 113: @@
 Using this bracket notation for rooted trees, one can recursively define elementary differentials, as follows. Let
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107050.png" /></td> </tr></table>
+$$
+F ( \emptyset ) ( y ( x ) ) = y ( x ) , \quad F ( \bullet ) ( y ( x ) ) = f ( y ( x ) ) ,
+$$
-and, for <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107051.png" /> with <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107052.png" /> for <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107053.png" />,
+and, for  $  t = [ t _ {1} \dots t _ {m} ] $
+with  $  \rho ( t _ {i} ) \geq  1 $
+for  $  i = 1 \dots m $,
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107054.png" /></td> </tr></table>
+$$
+F ( t ) ( y ( x ) ) =
+$$
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107055.png" /></td> </tr></table>
+$$
+=
+f ^ {( m ) } ( y ( x ) ) ( F ( t _ {1} ) ( y ( x ) ) \dots F ( t _ {m} ) ( y ( x ) ) ) ,
+$$
-where <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107056.png" />, the <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107057.png" />th derivative of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107058.png" /> with respect to <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107059.png" />, is a multi-linear mapping.
+where  $  f ^ {( m ) } ( y ( x ) ) $,
+the  $  m $
+th derivative of  $  f $
+with respect to  $  y $,
+is a multi-linear mapping.
-Since <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107060.png" /> satisfies <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107061.png" />, it is not hard to show that
+Since  $  y ( x ) $
+satisfies  $  y  ^  \prime  ( x ) = f ( y ( x ) ) $,
+it is not hard to show that
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107062.png" /></td> </tr></table>
+$$
+y ^ {( i ) } ( x ) = \sum _ {t \in T _ {i} } \alpha ( t ) F ( t ) ( y ( x ) ) ,
+$$
-where <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107063.png" /> is the <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107064.png" />th derivative of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107065.png" /> with respect to <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107066.png" />, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107067.png" />, and <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107068.png" /> is the number of distinct ways of labeling the nodes of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107069.png" /> with the integers <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107070.png" /> such that the labels increase as you follow any path from the root to a leaf of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107071.png" />. Consequently, one can write the [[Taylor series|Taylor series]] for <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107072.png" /> as a Butcher series:
+where  $  y ^ {( i ) } ( x ) $
+is the  $  i $
+th derivative of  $  y $
+with respect to  $  x $,
+$  T _ {i} = \{ {t \in T } : {\rho ( t ) = i } \} $,
+and  $  \alpha ( t ) $
+is the number of distinct ways of labeling the nodes of  $  t $
+with the integers  $  \{ 1 \dots \rho ( t ) \} $
+such that the labels increase as you follow any path from the root to a leaf of  $  t $.
+Consequently, one can write the [[Taylor series|Taylor series]] for  $  y ( x ) $
+as a Butcher series:
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107073.png" /></td> <td valign="top" style="width:5%;text-align:right;">(a1)</td></tr></table>
+$$ \tag{a1 }
+y ( x + h ) = \sum _ {i = 0 } ^  \infty   y ^ {( i ) } ( x ) {
+\frac{h  ^ {i} }{i! }
+ } =
+$$
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107074.png" /></td> </tr></table>
+$$
+=
+\sum _ {t \in T } \alpha ( t ) F ( t ) ( y ( x ) ) {
+\frac{h ^ {\rho ( t ) } }{\rho ( t ) ! }
+ } .
+$$
-The importance of Butcher series stems from their use to derive and to analyze numerical methods for differential equations. For example, consider the <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107076.png" />-stage Runge–Kutta formula (cf. [[Runge–Kutta method|Runge–Kutta method]])
+The importance of Butcher series stems from their use to derive and to analyze numerical methods for differential equations. For example, consider the  $  s $-
+stage Runge–Kutta formula (cf. [[Runge–Kutta method|Runge–Kutta method]])
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107077.png" /></td> <td valign="top" style="width:5%;text-align:right;">(a2)</td></tr></table>
+$$ \tag{a2 }
+y _ {n + 1 }  = y _ {n} + h \sum _ {i = 1 } ^ { s }  b _ {i} k _ {i} ,
+$$
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107078.png" /></td> </tr></table>
+$$
+k _ {i} = f \left ( y _ {n} + h \sum _ {j = 1 } ^ { s }  a _ {ij }  k _ {j} \right ) ,
+$$
-for the ordinary differential equation <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107079.png" />. Let <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107080.png" /> be the vector of weights and <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107081.png" /> the coefficient matrix for (a2). Then it can be shown that <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107082.png" /> can be written as the Butcher series
+for the ordinary differential equation  $  y  ^  \prime  ( x ) = f ( y ( x ) ) $.
+Let  $  b = ( b _ {1} \dots b _ {s} )  ^ {T} \in \mathbf R  ^ {s} $
+be the vector of weights and  $  A = [ a _ {ij }  ] \in \mathbf R ^ {s \times s } $
+the coefficient matrix for (a2). Then it can be shown that  $  y _ {n + 1 }  $
+can be written as the Butcher series
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107083.png" /></td> <td valign="top" style="width:5%;text-align:right;">(a3)</td></tr></table>
+$$ \tag{a3 }
+y _ {n + 1 }  = \sum _ {t \in T } \alpha ( t ) \psi ( t ) F ( t ) ( y _ {n} ) {
+\frac{h ^ {\rho ( t ) } }{\rho ( t ) ! }
+ } ,
+$$
-where the coefficients <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107084.png" />, defined below, depend only on the rooted tree <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107085.png" /> and the coefficients of the formula (a2).
+where the coefficients  $  \psi ( t ) \in \mathbf R $,
+defined below, depend only on the rooted tree  $  t $
+and the coefficients of the formula (a2).
-To define <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107086.png" />, first let
+To define  $  \psi ( t ) $,
+first let
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107087.png" /></td> </tr></table>
+$$
+\phi ( \emptyset ) = ( 1 \dots 1 )  ^ {T} \in \mathbf R  ^ {s} ,
+$$
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107088.png" /></td> </tr></table>
+$$
+\phi ( \bullet ) = A \phi ( \emptyset ) ,
+$$
-and then, for <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107089.png" /> with <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107090.png" /> for <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107091.png" />, define <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107092.png" /> recursively by
+and then, for  $  t = [ t _ {1} \dots t _ {m} ] $
+with  $  \rho ( t _ {i} ) \geq  1 $
+for  $  i = 1 \dots m $,
+define  $  \phi ( t ) $
+recursively by
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107093.png" /></td> </tr></table>
+$$
+\phi ( t ) = \rho ( t ) A ( \phi ( t _ {1} ) \dots \phi ( t _ {m} ) ) ,
+$$
-where the product of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107094.png" />'s in the bracket is taken component-wise. Then
+where the product of  $  \phi $'
+s in the bracket is taken component-wise. Then
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107095.png" /></td> </tr></table>
+$$
+\psi ( \emptyset ) = 1,
+$$
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107096.png" /></td> </tr></table>
+$$
+\psi ( \bullet ) = \sum _ {i = 1 } ^ { s }  b _ {i} ,
+$$
-and, for <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107097.png" /> with <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107098.png" /> for <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b11107099.png" />,
+and, for  $  t = [ t _ {1} \dots t _ {m} ] $
+with  $  \rho ( t _ {i} ) \geq  1 $
+for  $  i = 1 \dots m $,
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b111070100.png" /></td> </tr></table>
+$$
+\psi ( t ) = \rho ( t ) b  ^ {T} ( \phi ( t _ {1} ) \dots \phi ( t _ {m} ) ) ,
+$$
-where, again, the product of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b111070101.png" />'s in the bracket is taken componentwise.
+where, again, the product of  $  \phi $'
+s in the bracket is taken componentwise.
-Assuming that <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b111070102.png" /> and that <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b111070103.png" /> for all trees <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b111070104.png" /> of order <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b111070105.png" />, it follows immediately from (a1) and (a3) that
+Assuming that  $  y ( x ) = y _ {n} $
+and that  $  \psi ( t ) = 1 $
+for all trees  $  t $
+of order  $  \leq  p $,
+it follows immediately from (a1) and (a3) that
-<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b111070106.png" /></td> </tr></table>
+$$
+y _ {n + 1 }  = y ( x + h ) + {\mathcal O} ( h ^ {p + 1 } ) .
+$$
-The order of (a2) is the largest such <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/b/b111/b111070/b111070107.png" />.
+The order of (a2) is the largest such  $  p $.
 ====References====