Diagnosis of Endometrial Cancer Based on Back ... - IEEE Xplore

0 downloads 0 Views 431KB Size Report
Abstract. Endometrial cancer is one of the most common cancers in women worldwide. Early stage and accurate diagnosis is indispensable for treatment of ...
2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery

Diagnosis of endometrial cancer based on back-propagation neural network and near-infrared spectroscopy of tissue

Yuhong Xiang, Jing Tian, and Zhuoyong Zhang *

Yinmei Dai

Department of Chemistry, Capital Normal University,

Beijing Obstetrics and Gynecology Hospital, Affiliated to

Beijing 100048, P.R. China

Capital Medical University,

Corresponding E-mail address: [email protected]

Beijing 100006, P.R. China

Abstract

comparing with the data of 6000 deaths in 1996 [2]. The

Endometrial cancer is one of the most common

new case is more common in women who are older, white,

cancers in women worldwide. Early stage and accurate

affluent, obese, and of low parity, and especially women

diagnosis is indispensable for treatment of endometrial

with postmenopausal bleeding should be screened for

cancer patient. In this study, near-infrared spectra of 18

endometrial

normal, 30 hyperplasia and 29 malignant pathological

measurements of endometrial thickness and hysteroscopy

sections were collected. The original spectra were

have also been studied, biopsy sampling is currently the

pretreated by using smoothing, denoising, and data

most accurate and widely used screening technique [3]. But

compression methods, 6 principal components were

the diagnosis of biopsy sample has some limitations. For

extracted as the input of back propagation neural network

example, it is difficult to distinguish between the

(BPNN). The number of hidden neurons, learning rate,

endometrial cancer and atypical endometrial hyperplasia [3].

momentum, and learning epochs were optimized based on

Therefore, it is necessary to develop an accurate, fast,

the RMSE of leave-one-out cross validation (LOOCV). The

convenient and inexpensive method to diagnose the

optimal model of BPNN built can successfully classify the

endometrial cancer at early stage.

cancer.

Although

ultrasonographic

samples into three groups. The results showed that BPNN

Near-infrared spectroscopy (NIRS) has been known as

coupling with NIR spectroscopy can provide an efficient

an accurate, fast and inexpensive method for pathology

method for the early diagnosis of endometrial cancer.

studies. NIR spectra (750–2500 nm) can be assigned mostly to overtones and combinations of the molecular vibrations

Keywords-endometrial cancer; near-infrared spectrum;

of C–H, N–H, and O–H groups. These groups are

back propagation neural network;

composition of all biological molecules (lipids, proteins,

1. Introduction

carbohydrates, and water) [4, 5]. Cancer tissues differ from their normal parts in their composition. Any alteration in the composition of the tissues can be detected and used for

Endometrial cancer is one of the most common

diagnostic purposes.

gynaecological cancers. The data of the National Cancer

NIR

Institute showed 40100 new cases and 7470 deaths from

spectroscopy

technology

has

important

endometrial cancer in United States in 2008 [1]. Deaths

applications in cancer diagnosis. Most of researches dealing

from endometrial cancer are prone to continue rising

with human tissue were on breast cancer [6-9]. Besides

978-0-7695-3735-1/09 $25.00 © 2009 IEEE DOI 10.1109/FSKD.2009.470

508

breast cancer, the diagnoses of prostate cancer [10, 11],

were supplied by Beijing Obstetrics and Gynecology

pancreatic and colorectal cancer [4, 12] have also been

Hospital, Affiliated to Capital Medical University. The

reported. The method of cancer diagnoses above was to

mean age is 46 with the oldest 71 years and the youngest 19

quantify tissue chromophores or differentiate between the

years. All the endometrial tissues were fixed in 4%

absorption and scattering parameters. However, peaks in

formaldehyde, embedded in paraffin wax, section, xylol

many near-infrared spectra are overlapped, and in some

deparaffinization, dehydration with a series of increasing

cases, sloping baselines of spectra are observed. Therefore,

alcohol concentrations, adhibit, neural gum mounting. The

it is difficult to assign NIR absorbance bands. Computer

thickness of all the paraffin section was assumed to be 4

technology coupling with chemometrics can provide a

µm. The gold coated mirror onto the paraffin sections were

powerful tool for NIR applications [13-16]. Neural networks are mathematical models inspired by

placed on the integrating sphere. The NIR diffuse reflection

biological neurons. Back propagation neural networks

spectra were collected by the FT-NIR spectrometer

(BPNNs) are feed-forward network trained by propagation

controlled by Thermo Fisher Omnic software of version 7.3

of error back through the networks. Neural networks are a

with optical resolution of 4 cm− over the spectral range of

better method for pattern recognition and calibration than

4500 and 10000 cm− . A background spectrum was recorded

conventional statistical methods because of their advantages

using air reference at 25°. Five times was scanned for each

with

self-organization,

self-study,

self-adoption,

1

1

section on different positions, the mean was used. All the

and

NIR spectra of 77 paraffin sections were shown in Fig. 1.

unusual fault tolerance. In this work, NIR spectra of a total of 18 normal, 30 hyperplasia, and 29 malignant tissue slices were collected. The

original

Savitzky-Golay

spectra

were

smoothing

and

by

using

multiplicative

scatter

pretreated

correction (MSC). According to the standard deviation spectra of 77 pathological sections of endometrial tissues, the data of NIR spectra in the region of 4000-4600 cm-1 were selected as candidate input invariables. Principal component analysis (PCA) was used to compress the spectral data. The first 6 principal components were employed as the input variables of BP neural network. In the process of building BPNN, the RMSE of leave-one-out

Fig. 1. The NIR spectra of 77 tissue sections

cross validation was used as the goal function to optimize the number of hidden neurons, learning rate, momentum

From the Fig. 1, it can be seen that it is difficult to

and learning epochs. The optimal BP neural network built

distinguish the difference of NIR spectra of normal,

can successfully classify the samples into three groups. The

hyperplasia, and malignant endometrial tissues.

results showed that the BP neural network coupling with

In this study, according to the standard deviation

NIR spectra of endometrial tissue can provide an efficient

spectra of 77 pathological sections, the NIR spectra in

method for the early diagnosis of endometrial cancer.

region of 4000-4600 cm

-1

were selected for further

identification. NIR spectra of all the samples were firstly

2. Material and method

smoothed by the Savitzky-Golay (polynomial function order 3, smooth windows 5) method. Then, MSC was

Seventy seven paraffin sections of endometrial tissues

applied to correct spectra for spectral noise and background

509

effects which cause baseline shifting and tilting. Principal

input and output variables. However, if the number of

component analysis was employed to reduce the number of

hidden layer neuron is too large, the experimental error

input variables. The first 6 principal components accounting

(noise) will be introduced into the BPNN. In our work, the

for 99.97% original information of NIR spectra were

optimization process of neurons in hidden layer was shown

selected as the input variables of BP neural network. The

in Fig. 2.

transfer functions of neurons in hidden layer and out layer were ‘‘tansig’’ and ‘‘purelin’’, respectively. The samples were divided into two sets, training set and test set, by using Kennard-Stone method [17]. The training set including 58 samples was used to build the BP neural network model, and the 19 samples were used to evaluate the model. In the process of building BP neural network, leave-one-out cross validation was used to estimate the generalization properties of statistical models. Binary encoding, (1 0 0), (0 1 0) and (0 0 1), were used to express the diagnosis results for normal, hyperplasia, and malignant, respectively.

Fig. 2. Influence of the number of neurons in hidden layer on network

3. Results and Discussions

Note: the classification accuracy of leave-one-out cross validation (LOOCV) and test set were plotted with ○ and ●, and the

Traditionally, BPNN consists of three layers. In this

RMSE of LOOCV and test set were plotted with ▽ and ▼,

work, the traingdm function was used to train the network.

respectively (similarly hereinafter in below figures).

The optimal neural network model was obtained by

From the Fig. 2, it can be seen that the BPNN with 5

optimization of the number of neurons in hidden layer,

hidden neurons had the lowest root mean square error of

learning rate, momentum, and learning epochs. The goal

leave-one-out cross validation (RMSECV). And the

function RMSE of leave-one-out cross validation was used

classification accuracy (CA) rate for leave-one-out cross

to optimize the architecture and parameters of BPNN.

validation (LOOCV) and test set were 100% and 94.74%,

RMSE was calculated according to (1)

respectively.

The influence of learning rate (1) The

Where, yci and yei are calculated and experimental values,

learning

rate

is

an

important

parameter

influencing the stability and training speed of the BPNN. If

respectively, and p is the number of samples.

learning rate is too small, the slow convergence and local minima may be happen. However, if learning rate is too

The influence of hidden neural neurons

large, the convergence speed increases but the performance of neural network can not work well because the network

The number of hidden layer neurons plays a crucial

may jump over the global minimum. The optimization

role on the generalization ability of BPNN. If the number of

process of learning rate was shown in Fig. 3.

hidden layer neuron is too small, neural network can not simulate the complicated function relationships between

510

Learning epochs also influence the accuracy of BPNN. If the number of learning epoch is too small, the desired convergence will not be reached. Whereas, overfitting will be happen. The optimization process of learning epochs was shown in Fig. 5, in which the learning epoch varied from 1000 to 10000. From the Fig. 5, it can be seen that 7000 epochs with the lowest RMSECV was considered as the optimal learning epochs. The CA rate of leave-one-out cross validation and the test set were 100% and 94.74%, respectively. Fig. 3. Influence of learning rate on network

From the Fig. 3, it can be seen that when the learning rate reaches 0.6, the lowest RMSECV was obtained. The CA rate for leave-one-out cross validation and test set are 100% and 94.74%, respectively.

The influence of momentum Adding a momentum term can help to escape a small local minimum in the error surface and speed up the convergence. But if momentum is too large the BPNN will

Fig. 5. Influence of learning epochs on network

lead to small oscillations rather than divergence. From the optimization process shown in Fig. 4, it can be seen that

There was a wrong classification sample with number

when the momentum is 0.5, the lowest RMSECV and the

index 48 in test set. Sample 48 belongs to hyperplasia class.

highest CA rate was obtained.

Malinowski Distances between sample 48 and the center of malignant and hyperplasia class were investigated. The results showed that sample 48 was further from the center of hyperplasia than that of malignant. Therefore, sample 48 maybe an outlier, although further proof is still needed.

4. Conclusion The BP neural network with 6 input, 5 hidden, and 3 out neurons was built to diagnose the endometrial cancer based on the NIR spectra of endometrial tissue section. The

Fig. 4. Influence of momentum on network

classification

accuracy

rate

of

leave-one-out

cross

validation and test set were 100% and 94.7%, respectively.

The influence of learning epochs

The results indicated that neural network coupling with NIR spectroscopy of tissue can provide a new method for the

511

early diagnosis of endometrial cancer. Further work of this

York, 2005, 4(5): 497-512.

aspect is under investigation.

[10] J.H. Ali, W.B. Wang, M. Zevallos, and R.R. Alfano, Near infrared spectroscopy and imaging to probe differences in water content in normal and cancer human prostate tissues, Technol.

Acknowledgment

Cancer Res. Treat., Adenine press, New York, 2004, 3 (5): 491-497.

This work was supported by the Natural Science

[11] M.H. Rhiel, M.B. Cohen, M.A. Arnold, and D.W.

Foundation of China (20875065 and 30772322).

Murhammer, On-Line Monitoring of Human Prostate Cancer Cells

Reference:

in

[1] http://www.cancer.gov/cancertopics/types/endometrial.

Jersey, 2004, 86(7): 852-861.

[2] S.L.Parker, T.Tong, S.Bolden, and P.A.Wingo, Cancer statistics,

[12] V.R. Kondepati, J. Zimmermann, M. Keese, J. Sturm, B.C.

1996, CA Cancer J. Clin., Amer Cancer Soc, New York, 1996,

Manegold, and J. Backhaus, Near-infrared fiber optic spectroscopy

46:5-27.

as a novel diagnostic tool for the detection of pancreatic cancer, J.

[3] T.P. Canavan, and N.R. Doshi, endometrial cancer, Am.. Fam.

Biomed. Opt. Spie-int society optical engineering, Washington,

Physician, Amer ACAD family physicians, Kansas, 1999, 59(11):

2005, 10 (5): 054016(1-6).

3069-3077.

[13] H. Cen, Y. Bao, and Y. He, Pattern recognition of visible and

[4] V.R. Kondepati, M. Keese, R. Mueller, J. Backhaus, and B.C.

near-infrared spectroscopy from bayberry juice by use of partial

Manegold, Application of near-infrared spectroscopy for the

least squares and a backpropagation neural network, Appl Opt.,

diagnosis of colorectal cancer in resected human tissue specimens,

Optical soc Amer, Washington, 2006, 45(29): 7679-7683.

Vib. Spectrosc., Elsevier, Netherlands, 2007, 44, 236–242.

[14] F.X. Wang , Z.Y. Zhang, X.J. Cui, and P.B. Harrington,

[5] V.R. Kondepati, H.M. Heise, and J. Backhaus, Recent

Identification of rhubarbs by using NIR spectrometry and

applications of near-infrared spectroscopy in cancer diagnosis and

temperature-constrained cascade correlation networks, Talanta,

therapy,

Elsevier, Amsterdam, 2006, 70, 1170–1176.

Anal.

a

Perfusion

Rotating

Wall

Vessel

by

Near-Infrared

Spectroscopy, Biotechnol. bioeng., John Wiley & Sons INC, New

Bioanal.

Chem.,

Springer

Heidelberg,

Germany.,2008, 390:125–139.

[15] D. Perez-Marin, A. Garrido-Varo, J.E Guerrero, J.C.

[6] S.H. Chung, A.E. Cerussi, C. Klifa, H.M. Baek, O. Birgul, G.

Gutiérrez-Estrada,

Gulsen, S.I. Merritt, D. Hsiang, and B.J. Tromberg, In vivo water

near-infrared reflectance spectroscopy calibrations for predicting

state measurements in breast cancer using broadband diffuse

the inclusion percentages of wheat and sunflower meal in

optical spectroscopy, Phys. Med. Biol., Iop publishing LTD,

compound

Bristol, 2008, 53(23): 6713-6727.

spectroscopy, Moldova, 2006, 60(9): 1062-1069.

[7] S. Prince, and S. Malarvizhi, Monte Carlo simulation of NIR

[16] Y. Dou, T. Zou, T. Liu, N. Qu, and Y. Ren, Calibration in

diffuse reflectance in the normal and diseased human breast tissues,

non-linear NIR spectroscopy using principal component artificial

Biofactors, Ios press, Amsterdam, 2007, 30(4): 255-263.

neural networks, Spectrochim Acta A Mol. Biomol. Spectrosc.,

[8] N. Shah , A. Cerussi, C. Eker, J. Espinoza, J. Butler, J. Fishkin,

Pergamon-Elsevier science LTD, Oxford, 2007, 68(5): 1201-1206.

R. Hornung, and B. Tromberg, Noninvasive functional optical

[17] X.R. Zhu , N. Li, X.Y. Shi, Y.J. Qiao, and

spectroscopy of human breast tissue, Proc. Natl. Acad. Sci, Natl

Analysis of geniposide and total nitrogen content in Qingkailing

ACAD sciences, Washington, 2001, 98 (8): 4420-4425.

injection intermediate by near infrared spectroscopy and synergy

[9] S. Nioka, and B. Chance, NIR Spectroscopic Detection of

interval partial least squares, Chem. J. Chinese. U., Higher

Breast Cancer, Technol. Cancer Res. Treat., Adenine press, New

education press, Beijing, 2008, 29(5): 906-911

512

Use

feedingstuffs,

of

artificial

Appl.

neural

Spectrosc.,

networks

Soc

in

applied

Z.Y. Zhang,