Video-scanning system for measurement of lip and jaw motion* Martin J. McCutcheon, Samuel G. Fletcher, and Akira Hasegawa Departmentof Biocommunication,Universityof Alabama in Birmingham, UniversityStation,Birmingham, Alabama

35294

(Received 26 October 1976; revised 20 December 1976)

An instrument for measuringlabial and mandibular positionsduring speechis described.A modified

videosystemis usedto measurethe positions of smallreflectingglassbeadsattachedat selected poihtson the face and to a thin wire anchored on a mandibular tooth. The system allows continuousdata collection from up to 16 reflector points at a sampling rate of 100 Hz. Reflector movementswith instantaneous velocitiesup to 45 cm/sec can be determined with a 0.3-mm resolution. Techniquesfor data acquisition and analysisand displayare described. PACS numbers: 43.70.Ny, 43.70.Bk

camera and monitor modified for a 100-Hz framing

INTRODUCTION

Recent work in speech physiology has seen the development of a number of new research tools for the study of labial and mandibular functions. One reason for this

developmentis the necessityfor large volumesof data to determine strategies used in execution of coordinated movements of the articulators. Because of radiation, use of cinefluorographic technique may be contraindicated for this purpose. Apparatuses developed for dental studies typically are bulky and preclude simultaneous

studyof labial motions•-s andare therefore not well suited for speech physiological investigations.

Strain-gauge devices developed by Sussman and

Smith4 andby AbbsandGilbert• havebeenusedfor the study of labial and mandibular movements.

Also, Ohala

et al. e reportedsomepreliminaryresultsusinga photoelectric technique to transduce mandible movements. More recently, a low-dosage x-ray microbeam system has been developed to track small lead pellets attached

to the speecharticulators.7 It still introducesa definite risk

factor

however.

The instrumentation described in this paper is part of a custom-designed system for synchronous acquisition of data from labial, lingual, mandibular, and acoustic

sources.e As illustratedin theblockdiagramof Fig. 1, the system consistsof (1) a palatometer which detects tongue-to-palate contact with electrodes imbedded in a .very thin pseudopalatal plate that adheres to the

subject'shard palate, (2) a modified video system (gnathometer) that detects the locates the position of small light reflectors attachedto the speaker's face, (3) a

rate, (2) a video processor, (3) a set of X and Y coordinate storage registers interfaced to a P DP 11/40 computer. An internal clock provides a common time reference for all components of the total instrumentation system. ,

During data acquisition, the subject's face is illuminated with a uniform distribution of light and reflections from the small beads on the face form light spots on

the camera sensor. The camera utilizes a scanning process for converting the intensity of the spots into sharp peaks in the signals fed to the video processor. Scanning is effectively horizontal with 256 lines per field, with a framing rate of 100 Hzo

The video processor contains circuitry for detecting the peaks in the video signal, a network of adaptive gates, and two eight-bit X and Y counters synchronized to the system. The adaptive gate network is used to distinguish reflector data from spurious noise peaks in the video signals and to provide a unique set of coordinates for each reflector. During scanning, output signals from the gates cause data in the counters to be shifted to temporary

storage registers.

The registers

can

store data for up to 16 points per frame. The order of storage of coordinate values in the registers is determined by orientation of the points in the field with respect to the path of the scanning process. After scanning is completed, the coordinate values, together with data from other acquistion devices, are

parallel filter bank for spectral analysis of incoming

speechsignals, (4) a P DP 11/40 computerfor the control and acquisitionof data, and (5) data-display devices

.JGNATHO1

RA-[METER 1 ' •

that can be used in real time as monitors or as playback devices driven by the computer.

JIOEO•

The palatometer has been reported previously by

Fletcher, McCutcheon, andWolf.9 This paperwill be

-mGRAPH m m PDP mmy4ømm

concerned primarily with the system used for measuring labial and mandibular positions. I.

SYSTEM

DESCRIPTION PSUEO0PALATE

A block diagram of the video system is shown in Fig.

2. Its major componentsare: (1) a COHU model 6150 1051

J. Acoust. Soc. Am., Vol. 61, No. 4, April 1977

FIG. 1.

LED DISPLAY

Block diagram of total instrumentation system.

Copyright¸ 1977 by the AcousticalSociety of America

1051

Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 130.113.111.210 On: Fri, 19 Dec 2014 10:12:45

1052

McCutcheon, Fletcher,and Hasegawa: Videoscanning of lip andjaw motion

1052

CLOCK

AND

SYNC

TO

COMPUTER COUNTERS COORDINATE

8 BIT X

ADAPTIVE CAMERA

DETECTOR

STORAGE

FIG. 2. Block diagram of the video-scanning system.

8BITY

GATES

REGISTERS VIDEO i

PROCESSOR

FROM COMPUTER

DIGITAL COMPARATORS

transferred to a memory buffer in a PDP 11/40 computer.

The system throughout rate is sufficient to allow continuous data acquisition for up to 20 rain as data are im-

mediately transferred from the buffer memory to either magnetic tape or disk storage. To facilitate adjustments during data acquisition, signals from intermediate processing stages can be selectively mixed with the camera video and displayed as superimposed images on the monitor.

The monitor can also be operated as a display device driven by the computer with the necessary D/A conversion accomplishedby digital comparators (Fig. 2). The data then appear on the monitor screen as intensified

light spots. By controlling the rate at which data are updated by the computer, the display can be operated at a normal rate or in slow motion. Data acquisition and

display can also be carried out in a test mode, independent of the computer. II.

SYSTEM

PERFORMANCE

System performance is determined by a number of factors: lighting intensity and distribution, reflector characteristics, lens optics, camera sensor, scanning characteristics, The minimum

and the electronics. reflector

size that can be detected

de-

pends on the magnification provided by the optical system. The hardware detection scheme requires that the reflector

lines.

image subtend two or more successive scan

The reflectors are l-ram diam. glass beads

coated with glossy white enamel paint and attached to the skin with a medical

adhesive.

Their

small

size and

weight(0.05 g) afford minimummassloadingto the soft tissues of the lips and other structures.

Detection of

moving reflectors dependsprimarily on the dynamic response and scanning characteristics

of the camera sen-

sor. To evaluate the system's ability to locate a moving point, a number of, tests were run using a motor

FIG. 3. Anatomical reference points and coordinate axes in the midsaggitalplane. Points O and M are at the tips of the maxillary and mandibularincisors, respectively, andpoint P is located on the surface of the skin overlying the fronto-nasal suture.

J. Acoust. Soc. Am., Vol. 61, No. 4, April 1977

Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 130.113.111.210 On: Fri, 19 Dec 2014 10:12:45

1053

McCutcheon, Fletcher,and Hasegawa: Video scanning of lip andjaw motion

1053

secß Maximum instantaneous velocity in the X and Y

directions is 45 cm/sec. For measurements during speech, points on the lower lip are the most critical since the mandible and upper

lip move with slower velocities. The limiting 45-cm/sec velocity exceeds the maximum lower-lip

velocities of 30

cm/sec reportedby AbbsandGilbert• andapproximately 25 cm/secby Sussman et al. •o System linearity is about 2%. Stability tests con-

H!

ducted indicate no measurable instability for stationary reflectors. For moving reflectors, however, there is some variation in position as a result of nonuniformities in lighting and surface conditions of the painted beads. An estimate

of the maximum

error

in this

case is + 0.5

mm, or the bead radius. III.

•H 2 J2

DATA

ACQUISITION

In our kinematic studies of the lips and mandible we have for the most part used a profile view with reflec-

tors in the midsaggital plane of the subject's face. Point

O in Fig. 3 serves as a maxillary reference and point M on the edge of a lower incisor serves to characterize motions of the mandible. The position of point M on the mandibular

incisor

is calculated

from

measures

of re-

flectors located at points J• and J•. on a rigid wire at-

tachedby an orthodonticband to a canine tooth (see Fig. 4). The wire is shapedto exit near the corner of the FIG. 4. Typical location of reflectors used to determine articulator position and motionß

driven disk containing the 1-mm-diam. reflectors. The result of these tests clearly demonstrated that the positions of a reflector moving with maximum velocities up

to 45 cm/sec (1500 lines/sec) can be determined with a 0.3-mm resolution. Figure 5 illustrates measured horizontal and vertical positions of a disk-mounted re-

flector rotating at a constantangular velocity of 30 rad/

mouth at the level of labial margins then curved to place the reflectors in the midsaggital plane. Experiments indicate that vibrations of the short wire are sufficiently low in amplitude to be neglected. To determine geometric relationships between calculated and measured points, data are taken with teeth edge to edge and with a temporary reflector at the anterior edge of their contact. For these reference data, in effect points O and M coincide.

After speech data have been taken, the following calculations are made at each sample interval ø

Vertical

15

ß

o

ß

o

FIG.

-15

5.

Vertical

and horizontal

positions in mm versus time of a reflector rotating with a constant

angular velocity of 30 tad/sec.

Horizontal

Points represent samples taken at 10-msec

o•

ß

intervals.

ß

ß

-15

•o.

J. Acoust. Soc. Am., Vol. 61, No. 4, April 1977

Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 130.113.111.210 On: Fri, 19 Dec 2014 10:12:45

1054

McCutcheon,Fletcher,and Hasegawa:Video scanningof lip and jaw motion

UPPER

LIP

from

1054

the reference

measurements.

Equations (1) and (2) express the calculation in terms of the translational and rotational componentsof jaw

SKULL CANTILEVER

motion.

'U •li

USER DEFINED

I •

L

If the head is not restrained, simultaneous measurements and calculations are needed to determine the po-

POLYGONFOR THE OF

IDENTIFICATION

ß•

sition of point O from the measured positions of H• and //2 on a skull cantilever (see Fig. 5).

J2COORDINATE VALUES _

IV.

MANDIBLE

DATA

The reflector coordinates obtained during data acquisition are stored on a first-found, first-stored basis. Through the use of an off-line interactive program the

CANTILEVER

coordinates

Cumulative plot of reflector

coordinates over time.

The data arefor 15 repetitions ofthetestword

of a reflector

are identified

and sorted.

The interactive feature allows considerable flexibility in placement of the reflectors.

LOWER LIP

FIG. 6.

PROCESSING

[afa].

cosA- Y• sinA,

(1)

In the initial part of the program, reflector coordinate data are cumulatively plotted on a CRT storage screen as shown in Fig.. 6. This display illustrates the range of movement that took place for 15 repetitions of the ut-

Ym=Y•x + Y•u cosA +X•u sinA,

(2)

terance [a/a] from a profile view. Denser regions indi-

X,,,,=X.n +X•

cate slower moving or relatively stationary reflectors. The less-dense regions are associated with less-frequent and more-rapid movements.

where X•, Ymare the X and Y coordinates of the mandible point M, •x is the angle of rotation of the jaw from

iti referenceposition,X•x, Y•x are the measuredX and Y coordinatesfor point Jx; and X•M, Y•M are the X and

To determine the time history of a specific reflector, the associated cluster of data is enclosed by a user de-

Y components of the vector from J• to M as computed

12.5

10.0

HORI ZONTALOI SPLACEMENT (MM.)

o-

5.0

7.5

I

I

UPPER LIP

....... -.............. '--.' .... .... .... ß..............

IO.0

I

7.5

LOWER LIP

-

ß

ø

ø

' ß

.

.

/tOO

LOWER INCISOR

..'"-..

15--

5000

--

•ooo

--

"

LI P-TOOTH

', ....

'-'....

CONTACT

..............

'.:.... :L... :-:... :,•... • ... •...

F I LTERBANK

• •...

.....................................

• '

•...

;

Video-scanning system for measurement of lip and jaw motion.

Video-scanning system for measurement of lip and jaw motion* Martin J. McCutcheon, Samuel G. Fletcher, and Akira Hasegawa Departmentof Biocommunicatio...
524KB Sizes 0 Downloads 0 Views