A maximum likelihood approach to correlation dimension and entropy estimation.

Bulletin of MathematicalBiolooyVol. 54, No. 1, pp. 45-58, 1992. Printed in Great Britain.

0092-8240/9255.00 + 0.00 Pergamon Press pie 9 1991 Society for Mathematical Biology

A M A X I M U M L I K E L I H O O D A P P R O A C H TO CORRELATION DIMENSION AND ENTROPY ESTIMATION E. OLOFSEN, J. DEGOEDEand R. HEIJUNGS Department of Physiology, University of Leiden, P.O. Box 9604, 2300 RC Leiden, The Netherlands (Email; [email protected])

To obtain the correlation dimension and entropy from an experimental time series we derive estimators for these quantities together with expressions for their variances using a maximum likelihood approach. The validity of these expressions is supported by Monte Carlo simulations. We illustrate the use of the estimators with a local recording of atrial fibrillation obtained from a conscious dog.

1. Introduction. At present there is considerable interest in analysing experimental time series using methods from nonlinear dynamical systems theory. For a quantitative characterization of dynamical systems from a measured signal, algorithms have been developed to estimate the dimension spectrum and the generalized Kolmogorov entropies (Broggi, 1988; Schuster, 1988). These quantities can be used to differentiate between (quasi-) periodic, chaotic and stochastic processes. Time series from living organisms usually show irregular behaviour. It is interesting to determine whether the underlying dynamics is low-dimensional chaotic, because then it can in principle be modelled by a small set of deterministic nonlinear differential equations, implying that the irregularities are an intrinsic part of the process. Examples could be the brain activity and the functioning of the heart in both health and disease. The correlation integral method (Grassberger and Procaccia, 1983b; Takens, 1983) is widely used to estimate the correlation dimension and entropy. A sufficient condition for chaos is that the correlation entropy is positive. Thus to identify chaos, we need a measure of the uncertainty with which the correlation entropy can be estimated. Although attention has been given to the statistical error of estimators of the correlation dimension (Denker and Keller, 1986; Ramsey and Yuan, 1989; Abraham et al., 1990; Theiler, 1990a; 1990b), this is not so for estimators of the correlation entropy. This paper is organized as follows. In Section 2, we describe the correlation integral method. In Section 3 we will derive estimators of the correlation 45

46

E. OLOFSEN et al.

dimension and entropy together with expressions of their uncertainties using the maximum likelihood approach. We demonstrate their validity by Monte Carlo simulations in Section 4. An example of the usage of the expressions is given in Section 5, for a recording of atrial fibrillation and, finally, some conclusions are given in Section 6. 2. T h e C o r r e l a t i o n Integral. We start with a brief description of the correlation integral method, since it is the basis of our maximum likelihood approach. To characterize a dynamical system from a time series x(t), first the phase space of the system is reconstructed with the method of time delayed coordinates (Takens, 1981; Packard et al., 1980), i.e. with M vectors: x(ti) = [x(ti), x(t i + 1 At), . . . , x(t i + ( d - 1)1 At)]

(1)

where At is the sample time, d is the embedding dimension and I is an appropriate lag. The correlation integral C(r) is defined as: C ( r ) = M -2 (number of pairs ( i , j ) w i t h

(2)

where I I " " [I denotes a norm. It is assumed that distances, r , between two randomly chosen points in phase space obey the cumulative probability distribution function (Grassberger and Procaccia, 1983b): P(r) = qSr~= 47 exp(-- d/AtK2)r ~

(3)

and that C(r) ~ P(r) for sufficiently small r and large M. We also assume that q5 and 47 do not depend on r (see, however, Theiler, 1988) and that 47 does not depend on the embedding dimension. The correlation dimension v is now given by (Eckmann and Ruelle, 1985): v = lim lim

~-~o M-,o~

In C(r) In r

(4)

if d is large enough (Takens, 1981). The correlation entropy K 2 is given by: lim lim 1 lim In C(r) = - 1 A t K 2 . r~O d ~

(5)

d M~

With experimental data, a "scaling region" [r~, r,] must be identified, for which the distribution (3) is valid. The dimension can be estimated by the slope of a straight line through a number of points (In r~, In C(ri) ) in the scaling region. The entropy can be estimated by observing the behaviour of C(r) as a function of d but using equation (5) directly may result in slow convergence. Therefore

CORRELATION DIMENSION AND ENTROPY ESTIMATION

47

one usually studies the quotient of two correlation integrals, at embedding dimensions d and d + e (Grassberger and Procaccia, 1983a): l n ( C a ( r ) ~=el AtK 2. ~kCd +

(6)

e(r))

We will now show how to obtain maximum likelihood estimators for both v and K 2 , together with their asymptotic variances, under the assumption that the finite number of distances is the only source of error. 3. Maximum Likelihood Methods. For the correlation dimension, maximum likelihood estimators have been derived by Takens (1985) and Ellner (1988). In their approach, the distribution function (3) is normalized, such that P(ru) = 1, with the consequence that the correlation entropy cannot be determined. Now suppose we have a sample that consists of N independent distances, with N l in the interval [0, rl], N s in [rt, %] and N, in [r., 1]. The likelihood function for this doubly censored set of data is (Kendall and Stuart, 1979):

t_ \ u / ~

/

i=x

,:,vr=' r, ~

[1-P]N~

(7)

where A is a permutation coefficient and p = 4~r~ is used for convenience. We now consider the case where the sample consists of N d distances calculated at embedding dimension d, and Nd+ e distances calculated at embedding dimension d+e. We also assume that the N d and Nd+ e distances are independent. The likelihood function for this case, L(v, p~, Pd+e), is the production of the likelihood functions Ld(v, Pd) and Ld+e(V , Pd+e)" By solving the likelihood equations (Kendall and Stuart, 1979), we find the maximum likelihood estimators of the parameters v, pd and Pd+e" These are: V=

Ns'aW Ns'd+e N~,a

i= 1

ln(rl,a~ +

ln(ri~+Ntd \ru,d/

'

k,ru,d/

(8)

Ns,d+e

j =1

In( rj ~ + N l d . e l n ( r t , d + ~ \%,d+e/ \r~,d+e/ "

'

and:

Pa - Nt'a + Ns'd

(9)

with a similar expression for Pd +e" The maximum likelihood estimator of the correlation entropy is:

48

E. O L O F S E N et al.

ln(Pdru,d +ae~ -7-- .--U~v \P~+er"'dJ el At

fs

(10)

where we used equation (3) and the property that a function of m a x i m u m likelihood estimators is itself a m a x i m u m likelihood estimator. The asymptotic variances of the dimension and entropy estimators are obtained by inverting the information matrix (Kendall and Stuart, 1979). We find: ~2

var(f) =

(11) r

v

r

v

Napa(1--(~ I I + Na+~Pa+~(1-( ~'a+---~ll \

\

u,d/ /

\

\ruA+ e/

/

and: var(/~:) -

1--pdl--P~+e ( I Ndp~ + Nd+ePd+e+ var(v)ln2 k r ur~'a 'd+e/ (el At) 2

(12)

If r~,~= r~.~+ ~, then f and/~2 are uncorrelated. Moreover, the estimator of the entropy is equivalent to equation (6) if one substitutes r = r~,d = r~,d + e and if the correlation integrals are based on independent distances. The m a x i m u m likelihood estimator of the correlation dimension calculated at a single embedding dimension reads: Yd ~ Ns, d

--Nsd In(r/) + Nl,a ln(rl,d)

(13)

i=1

Its asymptotic variance is given by: varffa ) -

(14)

These equations are slight generalizations of Ellner's results. Note that the expressions for the "double" correlation dimension [equation (8)] and entropy [equation (10)] are only meaningful if the "single" correlation dimensions ~n and 9n+ ~ [equation (13)] do not significantly differ. 4. Monte Carlo Simulations. Monte Carlo methods can be used to investigate properties of the distributions of r a n d o m variables. Here we will study the properties of the derived m a x i m u m likelihood estimators in two interesting

CORRELATION DIMENSION AND ENTROPY ESTIMATION

49

cases: (1) using distances drawn directly from the proposed distribution [equation (3)]; and (2) using distances between points in reconstructed phase space, using time series of a (simulated) chaotic system. 4.1. Simulations using "ideal data". F r o m 1000 Monte Carlo trials of the dimension and entropy estimates using simulated data, i.e. independent distances drawn from the distribution [equation (3)], the means and variances were estimated. These were compared with the specified v and K 2 and the asymptotic variances. The estimators and the expressions for the asymptotic variances appear to be accurate if both Nl, d + Ns,~ and Nl, d +e + N~,d +e are larger than about 50. 4.2. Simulations using time series. Time series were generated using the following classical examples: 1. The H6non map, governed by the equations (Grassberger et al., 1988; H6non, 1976): x.+ i = l - a x 2

+ by~

Y.+l = x ,

(15)

with a = l . 4 and b=0.3, x o and Yo were chosen uniformly in [-1, 1] and [ - 0 . 1 , 0.1], respectively. Initial conditions within this area cause the iterates to approach the attractor (H6non, 1976). Literature values are: v = 1.22 and K 2 =0.325 (Grassberger and Procaccia, 1983a). 2. The logistic map, governed by the equation (Grassberger et al., 1988): x,+l=l-ax

~

(16)

For a = 2, analytical results (Grassberger et al., 1988) are v = 1 and K 2 = In 2. x o was chosen uniformly in [ - 1, 1], so that any x o is near the attractor. 3. The sine wave: x, = sin(ore + e~o)

(17)

with ~ = 2/~, resulting in approximately 10 points per cycle and ~o chosen uniformly in [0, 2n]. The values for v and K 2 are 1 and 0, respectively. The x-variable was used to generate a time series of length L (At = 1); the first L s iterates were discarded to avoid transient effects. 4.2.1. Effects of correlations between distances. For the derivation of the m a x i m u m likelihood estimators of the correlation dimension and entropy and their variances, we had to assume that the distances used are independent. The question now arises how many distances are independent if we have P

50

E. OLOFSEN et al.

independent points in phase space. Ellner (1988) states that these distances must be calculated from non-overlapping collections of point, so that: (18)

N

A maximum likelihood approach to diffeomorphic speckle tracking for 3D strain estimation in echocardiography.

Targeted Maximum Likelihood Estimation for Pharmacoepidemiologic Research.

MAXIMUM LIKELIHOOD ESTIMATION FOR SOCIAL NETWORK DYNAMICS.

Towards in vivo estimation of reaction kinetics using high-throughput metabolomics data: a maximum likelihood approach.

A maximum-likelihood estimation of pairwise relatedness for autopolyploids.

Neurofibromatosis-1: a maximum likelihood estimation of mutation rate.

PsiMLE: A maximum-likelihood estimation approach to estimating psychophysical scaling and variability more reliably, efficiently, and flexibly.

Stepwise iterative maximum likelihood clustering approach.

GNSS Spoofing Detection and Mitigation Based on Maximum Likelihood Estimation.

Maximum likelihood estimation and identification directly from single-channel recordings.

Maximum likelihood, profile likelihood, and penalized likelihood: a primer.

Maximum likelihood estimation of the mixture of log-concave densities.

Maximum likelihood estimation of motor unit firing pattern statistics.

Nonparametric maximum likelihood estimation for the multisample Wicksell corpuscle problem.

Maximum likelihood estimation of shear wave speed in transient elastography.

Maximum Likelihood Estimation Over Directed Acyclic Gaussian Graphs.

Rater Reliability: A Maximum Likelihood Confirmatory Factor-Analytic Approach.

Bias of maximum likelihood estimator of intraclass correlation.

MEMLET: An Easy-to-Use Tool for Data Fitting and Model Comparison Using Maximum-Likelihood Estimation.

Abstract: Applying Maximum Likelihood Estimation and Multiple Imputation to Moderated Regression Models With Incomplete Predictor Variables.

A penalized likelihood approach for robust estimation of isoform expression.

Maximum likelihood methods.

Analytical estimation of the correlation dimension of integer lattices.

Fast maximum likelihood estimation of mutation rates using a birth-death process.