O n l i n e     P u b l i c a t i o n M e d i c a l   B i o i n f o r m a t i c s
Proposal http://werner.yellowcouch.org/postdocbergen/index.html

Analysis, Normalization & Estimation Algorithms for the Stratification of AML/ALL Cancer Patients

Werner Van Belle; 19 October 2006
Contact: werner@sigtrans.org, werner.van.belle@gmail.com

Abstract: This is a postdoctoral proposal sent to the University in Bergen. This project aims to apply signal processing techniques to analyze, normalize and build models for cancer proteomics. Through linking biomedical parameters (such as survival rate) to specific protein samples (from patients), better therapy might come into reach. The project parallelly investigates data handling procedures for mass spectrometry (MALDI TOF), 2D electrophoretic gels and gene array data. The built models will be validated as estimators in a cancer research setting.

Refering: WVB06] Werner Van Belle; Analysis, Normalization & Estimation Algorithms for the Stratification of AML/ALL Cancer Patients. Postdoctoral Proposal submitted to the University of Bergen. October 2006

Keywords: patient stratification, AML/ALL PAtients, denoising, normalization

Disclaimer: This is a proposal written by the beforementioned partners. Copying part of it without giving due credit must be considered plagiarism. If you are interested in this research consider contacting one of the partners to discuss the presented research.

1 Research Context

Recently a cooperation with Haukeland University Hospital led to experiments in relation finding on p53Ab stained immunoblots from 73 acute myeloid leukemia (AML) patients and acute lymphoblastic leukemia (ALL) patients. These experiments revealed various correlations (ranging from -0.304 to 0.458 using the stringent Spearman rank order coefficient) between the patient survival rate and the patient immunoblot (taken before treatment).

1.1 Leukemia

Figure 1: After proper alignment and calibration of 2DE immunoblots of clinically tested AML and ALL patients, the correlation analysis determines relations between the immunoblots and the clinical parameter. Green color in the correlation image indicate correlation, brown indicates anti-correlation.
Image ProcessFlow

AML is a rapidly developing malignancy of the non-lymphoid bone marrow, characterized by a block in differentiation and proliferation of immature cells, which suppresses normal bone marrow function [LDB99]. Prognosis of the AML patient is determined based on cytogenetic aberrations [GWO+98], morphological disease free bone marrow after first chemotherapy [WAG99] and mutational status for the receptor tyrosine kinase Flt3 [KGF01]. However, about 50% of the patients have no reliable marker for prognosis, in where gene expression analysis may add prognostic information about these patients [VVB+04,BDB+04]. The current hypothesis of leukemogenesis in AML involves one mutational hit in a signal transduction system and one hit in a transcription factor [GCF04], and gene expression analysis may reflect hits in thus far unknown signaling networks. We have recently suggested that signaling through phosphoprotein networks elucidate the underlying response mechanisms for chemotherapy, and may mirror the risk stratification in AML [IHK+04]. A question that rose was whether it was possible to decode this information at a complex junction in cancer cell fate determination after chemotherapy; at the post-translational modifications level of the p53 protein.

1.2 Relation Finding

Figure 2: Correlation plot of p53 isoforms related to survival rate in treated AML patients. Left: flat image; Right: 3D plot
Image PosCors

We visualized p53 protein isoform distribution in purified AML cells for patients using two-dimensional gel electrophoresis with immunoblot analysis (2DI) of p53. This method determined the unique p53 pattern of 39 AML cases and 8 ALL cases. After alignment and normalization of 2DIs we performed a correlation analysis with the clinical parameters overall survival, response after first course of chemotherapy [WAG99] and finally the Flt3 mutational status [KGF01]. We observed significant consistencies in p53 pattern for these parameters. The method used to measure the correlation and their significance is given in figure 1.

Fig. 2 shows the measured correlations. At least 16 interesting spots can be found: a) 3 correlating p53-$\delta$ isoforms, b) 4 anti-correlating spots in the low pI p53-$\alpha$ tail, c) approximately 3 anti-correlating spots in the hi-pI p53-$\alpha$ tail, d) 3 anti-correlating spots below the p53-$\delta$ region, e) a strong anti-correlating p35 (putative short form of p53) and f) 4 anti-correlating spots in the hi-pI p53-$\delta$ tail. The particular presented 3D plots reveal correlations between immunoblots and survival rate. We performed further experiments to relate other parameters, such as AML FAB classification, patient remission, AML/ALL classification and Flt3 WT vs ITD mutations. They all revealed similarly important results that could not be obtained using standard analysis techniques (relying on spot detection, or with fewer 2DE gels).

1.3 Prediction

The many correlations we find in the p53Ab stained immunoblots leads us to believe that the presented technique can be further developed to predict patient outcome. The challenge we face with the current data is that the single individual correlations, despite being relatively strong, are still too weak to be used as a prediction tool. E.g: one cannot simply measure one p53-$\delta$ spot to predict survival rate. Instead, we believe that a suitable combination of all relevant spots will lead to a much better predictor.

2 Project Plan

The proposed project first investigates calibration and normalization between various data acquisition techniques, including 2D electrophoretic gel imaging, MALDI-TOF mass spectrometry and micro-arrays. Afterward, classification and prediction techniques will be developed in the context of patient stratification. The following paragraphs each represent a specific task that we will conduct during the postdoctoral period. englishAll the 2D Gel images in this proposal are either produced by Nina Ånensen or by Gry Sjøholt. The mass spectra are obtained in coopeartion with Olav Mjaavatten.

2.1 Denoising of Mass Spectrograms

At the University of Bergen (the Proteomics Unit), we performed a number of measurements on the Ultraflex II and Autoflex machines (MALDI TOF and MALDI TOF-TOF). The output of these machines contains a number of artifacts, which are also present on machines located at other sites, such as the Flemish Biotechnology Center and freely published online spectra. The investigation of the spectral output of the machine is based on a sliding windows Fourier transform (SFFT). In our calibration tests we found a) linear tones, b) decaying tones and c) coherent noise.

Figure 3: Artifacts in a typical mass spectrometry. Left: in reflection mode we see static and up-sweeping tones. Right: in 'lift' we see decaying tones.
Image GlobalNoiseSignalFixed2
Image LiftNoise

englishFigure 3a shows 3 static tones superimposed over the signal (these are the three horizontal lines), as well as 3 linear upward sweeping tones (the three slightly upward slanted lines) and a burst of noise shortly after the deflection mode of the machine.

In a second experiment (figure 3b) we measured the lift of a peak using a MALDI-TOF/TOF machine. The mixture contained a protein-fragment which was to be sequenced. The frequency analysis shows static tones at a low frequency and decaying tones which start at a high frequency and decay exponentially. Also, these LIFT experiments have often a very low signal/noise level. Even so that an expert is needed to select the correct peaks for further analysis.american

Figure 4: Noise coherence in linear mode on an empty plate. Depending on the number of shots, the noise level increases at specific positions.
Image Noise1
Image Noise100
Image Noise10

In a third experiment we measured the pure noise output of a MALDI machine in linear mode. During the experiment, the laser was switched off, as such we measured only the noise generated by the machine. In this experiment we found noise coherence. Depending on the number of shots performed, the noise fingerprint will be different, and the more shots are performed the lower the signal/noise ratio becomes. Figure 4 shows the noise fingerprint after a different number of shots (1, 10, 100 and 1000). One shot gives rise to a strong pulse at a certain position. The location of this pulse is random however there are positions which have a higher chance of producing such a phantom peak. So, depending on the number of shots performed we will see different phantom peaks. If we perform few shots the phantom peaks will be strong, if there are many shots, the noise will overrun the intended measurement. american

Many of these artifacts can be removed and our preliminary results already led to a number of algorithms to do so [BM05]. Nevertheless, more validation is necessary to publish these results. Especially, validation of known standards, comparison of the output of peak detectors on both denoised and original mass spectra. To achieve this we will cooperate closely with PROBE and the VIB (University of Ghent; Belgium).


2.2 english2D Gel Alignment & Automatic Isoform Detection

Figure 5: Progressive multi image alignment retains features of the original images but goes fast. Multi image alignment based on similarity matrices is much more accurate but takes a long time (days) to compute. Once the process completes important isoforms can be readily observed.

Image RoiP63
Gel 1
Gel 2
Automatic isoform detecion

Image SummedOverlayMotherAA2

Image SummedOverlayMotherSOS_2 Image RoiOverlayGammaCorrected
Aligned according to gel 1
Aligned according to gel 2
Multi aligned image stack

Correlation and analysis of multiple 2DE gels relies properly aligned images. Standard alignment algorithms typically work between any two 2DE gels [WF05,AKY+03,IAH95,DI94,Cur04,CP92]. Unexpectedly, there seems to be no good straight forward extension towards multiple images. Neither of the trivial extensions, among which dynamic alignment (align one image on top of the previous image) and progressive alignment (align a new image on the projected sum of all previous images) produce good results. They tend to strengthen features found in the initial image (Fig. 5).

We experimented with an algorithm that positions all images such that the distance between every image-pair is as close as possible to the measured translation of that pair. The results are promising because all the expected isoforms could be automatically detected in the projected image. Nevertheless, the long computation time (3 days for 90 images using IDL [RSIR] on a standard PC) prompts us to investigate the possibilities optimized search algorithms [Bel06]. We plan to publish these results.


2.3 Gel Field Warping

Figure 6: Different artifacts commonly found in 2D gels. Left) lens warping; Right) crumpling which leads to field distortions.
Image WarpedGridImage CrumplingBounded

It is well know that 2D gels tend to bend in the middle area. This together with the standard handling procedures (washing and possible drying) lead to spots that shift around the surface in an inconsistent manner. Aside from the gel handling procedures, Lens effects might be responsible for spot shifts as well (both demonstrated in figure american6).

We will investigate the creation of field warps to account for all of these effects. Combined with the multi-image alignment algorithms from §2.2, this should lead to more reliable mappings between various samples. Tentatively, our metric will minimize the first differential of the displacement map; as such thin plate splines with local landmark detection will be investigated [DB03,Elo,Boo89,Duc76,Mei79,Wah90,Bel00,WAB+04,WF05].


2.4 Hardware Biases

Figure 7: Two different machines imaging the same 2D Gel. Left) the Typhoon image; Middle) the Kodak image. Right) Similar biases can be found in gene-arrays.
Image 2Machines Image gene-array-bias

Figure 7 shows one gel captured by two different cameras. An important observation here is that the relative grey value levels are not the same on the two machines. Area A shows three clearly differentiated spots for the Typhoon imager, while this area is a blur in the Kodak imager due to a too strong signal level (Kodak A > Typhoon A). On the other hand, the Typhoon imager observes darker grey values in area B than the Kodak imager (Kodak B < Typhoon B). This is a clear contradiction of which the cause lies in the different imaging approaches used by the two cameras. This indicates that normalizations of 2D gels, which is crucial for a scientific quantitative protein analysis, remains a challenging task [LK03]. englishA similar problem might arise in gene-array data where, depending on the hardware lens effects might occur as well (Figure 7).

We will establish standard procedures for data normalization based on specific machine measurements (base line acquisition). This will lead to data data can be compared between different machines. In turn this leads to better quantifiable experiments and it will also be necessary for further correlation tests (§2.5) and gel-array correlations (§2.7).


2.5 Intra Gel Correlation / Feature Vector Correlation

In 2005, we developed a technique to measure correlations between 2D gels and biomedical variables [ÅIHDBG06,BAH+06]. This technique can be further refined taking into account baselines and new alignment algorithms as described in §2.2, §2.3 and §2.4. The analysis technique will also be extended with novel significance measures based on canonical correlations, copula testes and linearity tests. Further spectral analysis will be introduced to measure oscillatory phenomena in multiple 2D images. These initial improvements will be integrated into the GelSignal software (http://www.sigtrans.org/) and might lead to a new publication based on the software technology.

After that, more thorough improvements will become possible. A useful technique to analyze behavior in 2DE gels are intra-gel correlations. This technique compares a specific position towards every other position in the gel and reports how similar their behavior is, irrelevant of the external biomedical variable. Such analysis will helps biomedical sciences to find and identify spots with the same behavior. Extending this further makes it possible to link every position in the gel towards the same external feature vector. This will be important for §2.7 and §2.8.

2.6 Dimensionality reduction

Dimensionality reduction is an important tool in both visualization and actual analysis of results. We will rely on singular value decomposition: PCA (Principal component analysis) and PCR (principal component regression) [C.M95] is commonly used to reduce the dimensionality of multidimensional data. Specific to our research is the possibility to consider every gel as a point in an $x\times y$ dimensional space with $x$ and $y$ the respective horizontal and vertical size of the images. Reducing this large dimensional space ($10^{6}$ dimensions at a common size of $1024\times1024$) to a number of principal axis should allow us to pseudo-color gels by assigning colors to the principal axis. Other techniques such as image segmentation [SC95,CS96] will be investigated as well. Both make it possible to compress the gel content for further analysis in a predictive context (§2.8). The same techniques are applicable to micro array data as well (§2.7).


2.7 Multi-variate Gene Array Correlation

An important possibility offered by the correlation tests we developed is its extension to link gene-array data to 2D gels. Such 4 dimensional maps (one correlation picture for every micro-array dot) can be best visualized by first reducing the dimensionality of the associated micro-array information. (E.g: dots that behave the same can be classified in their own cluster and lead to a feature vector to which the gel images are correlated.

We will test the current available micro data [ØBJ+06,ØBJ05] towards the various patients by means of the before mentioned dimensionality reduction techniques and the new correlation capabilities as developed in §2.5.


2.8 Class Prediction

Automatic classification can both be used as an exploratory tool and a prediction tool. Unsupervised learning techniques will help detecting natural classes while supervised methods will weigh different attributes based on their relation to the associated biomedical parameter.

Unsupervised classification will be provided as a form of exploratory and data understanding tool. The kind of classification we will provide is Bayesian classification as implemented in Autoclass [SC95,CS96]. If this turns out too slow, we will rely on the data reduction techniques developed earlier (PCA/PCR analysis). Our expertise in this area will be augmented with multivariate clustering algorithms developed by the applied economics research lab in Leuven, Belgium [CJ05,CRG05,SCFE0x]. Pseudo-coloring will be conducted along the following lines. Every class will be associated with a color. The influence of every attribute (pixel) onto that specific class determines the presence of the specific color. Supervised classification will be explored together with the bioinformatics platform in Bergen.


2.9 Estimation using Partial Least Squares english(PLS)

Partial least squares (PLS) is a linear regression technique that has been successfully applied to chemometrics problems. Typically, chemometric problems have small training data sets, but high dimensionality, for instance a small number of $N$-point spectra, with corresponding known constituent concentration data. The number of different constituents $P$ contributing to each spectrum is necessarily much smaller than $N$.

PLS is closely related to PCR, but take advantage of response information in the construction of component vectors. Low rank projections of both the data matrix and the response matrix are calculated, and the component vectors are found by minimizing a quadratic cost function depending on both the low rank data matrix and the low rank response matrix. This is the main difference between PCR and PLS. In the special case of one-dimensional response data (which is already low rank), the least squares regression corresponds to maximizing the covariance between the remaining lowrank data and the response data vector sequentially until sufficient covariance is accounted for.

The main advantage of PLS over PCR is obviously that the component vectors are directly related to the response data of interest, rather than ordered only after how much of the variability in the data set is accounted for. This means that the component vectors are much more likely to be physically meaningful. In chemometric applications, the component vectors are usually more similar to the known constituent spectra than the principal components of PCR. In some cases, application of the PLS method has even resulted in significant spectral residues, attributed to contaminants, that were not present in the original calibration mixtures. Thus, the PLS method potentially has high predictive potential. american

2.10 2DE Gel / Biomedical Prediction

Prediction of gel images or associated parameters based on PLQ and Bayesian Estimation as developed in (§2.8 and §2.9) will be implemented into the SigTrans GelSignal software. In the first scenario, a 'prototype gel' will be rendered for a specific biomedical parameter. This can help the researchers to find spots that behave unnaturally. In the second scenario the biomedical parameter will be estimated based on the gel images. This will make it possible to use the algorithms in a patient stratification and outcome prediction context. The validation of the estimators will be performed using jackknife testing.

3 Applicability

The advantages and applicability of the presented research are manifold. Denoising and normalization of inter-instrument data will lead to

    1. more accurate interpretation of data. In the case of MALDI TOF denoising, we can remove the $\frac{m}{z}$ shift introduced through the noise. For 2D gels, positioning of peaks can be performed better if background artifacts have been removed.
    2. automatic analysis of data. By removing unwanted artifacts, which are normally ignored by the expert, we can speed up the analysis process and avoid human errors. This is especially crucial in high throughput proteomics, which is feasible only if the data can be analyzed and assessed automatically.
    3. quantitative approaches. Normalization of data over different machines is important if we are interested in quantitative data (as opposed to qualitative data). E.g; measuring up and down regulation of post-translational modifications using 2D gels or measuring protein content changes using MALDI and diagonal chromatography methods. These techniques are only scientific valid if the interpretation of an experiment is independent of the actual hardware involved. We already indicated (figure 7) how challenging this can be for even the most basic technique: 2D Gels. Clearly there is an urgent need for a better understanding of the different machines and techniques involved. The best way to create such an understanding is by making conversion routines and normalization routines.
    4. understanding technique related errors. An important side effect of this research is that we will be able to quantify the experimental errors related to every specific technique. Currently there is very little known in this area.
    5. software for FUDGE. The methodology we will develop will be integrated americaninto the GelSignal interactive image spreadsheet. This can be beneficial for basic and clinical research projects in multidisciplinary fields. E.g: comparison (phospho)protein patterns in unstimulated versus stimulated cells or in normal cells versus cancer cells. Appropriate data handling facilitates the detection of alternatively expressed (or phosphorylated) proteins. Also protein patterns in normal tissue versus diseased tissue (E.g: normal brain tissue vs Huntington's disease tissue) or non infected versus virus-infected cells can be analyzed. The specific knowledge transfer we propose and the public availability of the software might be very valuable for FUGE
    6. patient stratification in a cancer setting, which might lead to better and individualized therapy. The methods developed in this research might be applicable to other diseases as well.

Bibliography

AKY+03 - Roy Anindya, R. Lee Kwan, Hang Yaming, Mark Marten, Raman Babu. Analyzing two-dimensional gel images. Raport instytutowy, Department of Mathematics and Statistics, University of Maryland, August 2003.

BAH+06 - Werner Van Belle, Nina Anensen, Ingvild Haaland, Øystein Bruserud, Kjell-Arild Høgda, Bjørn Tore Gjertsen. Correlation analysis of two-dimensional electrophoretic protein patterns and biological variables. BMC Bioinformatics, 7(198), April 2006. http://www.biomedcentral.com/1471-2105/7/198.

BDB+04 - L. Bullinger, K. Dohner, E. Beir, S. Frohling, R.F. Schlenk, R. Tibshirani, H. Dohner, J.R. Pollack. Use of gene-expression profiling to identify prognostic subclasses in adult acute myeloid leukemia. N Engl J Med, 16(350):1605-16, April 15 2004.

Bel00 - Serge Belongie. Thin plate spline. Raport instytutowy, From MathWorld-A Wolfram Web Resource, 2000. http://mathworld.wolfram.com/ThinPlateSpline.html.

Bel06 - Werner Van Belle. Correlation between the sum of the inproduct and the sum of absolute differences is -0.8485 for uniform sampled signals on [-1:1]. Submitted to Transactions on Pattern Analysis and Machine Intelligence, 2006.

BM05 - Werner Van Belle, Olav Mjaavatten. Denoising of maldi tof mass spectra. Proceedings of the VIIth European Symposium of the Protein Society (AOPC2005), 2005.

Boo89 - F.L. Bookstein. Principal warps: Thin plate splines and the decomposition of deformations. IEEE Trans. Pattern Anal. Mach. Intell, 11:567-585, 1989.

CJ05 - Christophe Croux, Kristel Joossens. Influence of observations on the misclasification probability in quadratic discriminant analysis. Journal of Multivariate Analysis, 2005.

C.M95 - C.M.Bishop. Neural Networks for Pattern Recognition. Oxford University Press, 1995.

CP92- K. Conradsen, J. Pedersen. Analysis of two-dimensional electrophoresis gels. Biometrics, 48:1273-1287, 1992.

CRG05- C. Croux, A. Ruiz-Gazen. High breakdown estimators for principal components: the projection-pursuit approach revisited. Journal of Multivariate Analysis, 95:206-226, 2005.

CS96- P. Cheeseman, J. Stutz. Advances in Knowledge Discovery and Data Mining, rozdzia/l Bayesian Classification (AutoClass): Theory and Results. AAAI Press/MIT Press, 1996.

Cur04- S. Curch. Advances in two-dimensional gel matching technology. Biochem Soc Trans, 32(Pt3):511-516, June 2004.

DB03- Gianluca Donato, Serge Belongie. Approximation methods for thin plate spline mappings and principal warps. Raport instytutowy CS2003-0764, UCSD Departmebnt of Computer Science and Engineering, September 2003. http://www.cs.ucsd.edu/Dienst/UI/2.0/Describe/ncstrl.ucsd_cse/CS2003-0764.

DI94- Donoho D.L, I.M.Johnstone. Ideal spatial adaptation by wavelet shrinkage. Biometrika, 81:425-455, 1994.

Duc76- J. Duchon. Interpolation des fonctions de deux variables suivant le principe de la flexion des plaques minces. RAIRO Analyse Numérique, 10:5-12, 1976.

Elo- Jarno Elonen. Thin plate spline editor - an example program in c++. Raport instytutowy URN:NBN:fi-fe20031422. http://elonen.iki.fi/code/tpsdemo/index.html.

GCF04- D.G. Gilliland, C.T. Jordan CT, C.A. Felix. The molecular basis of leukemia. Hematology (Am. Soc. Hematol. Educ. Program), strony 80-97, 2004.

GWO+98- D. Grimwade, H. Walker, F. Oliver, K. Wheatley, C. Harrison, J. Rees, I. Hann, R. Stevens, A. Burnett, A. Goldstone. The importance of diagnostic cytogenetics on outcome in aml: analysis of 1612 patients entered into the mrc aml 10 trial. The Medical Research Council Adult and Children's Leukaemia Working Parties; Blood, 92:2322-33, 1998.

IAH95- M. Irani, P. Anandan, S. Hsu. Mosaic based representation of video sequences and their applications. Proceedings of the 5th ICCV, 1995.

IHK+04- J.M. Irish, R. Hovland, P.O. Krutzik, O.D. Perez, Ø. Bruserud, B.T. Gjertsen, G.P. Nolan. Single cell profiling of potentiated phospho-protein networks in cancer cells. Cell, 118:217-228, 2004.

KGF01- P.D. Kottaridis, R.E. Gale, M.E. Frew. The presence of a flt3 internal tandem duplication in patients with acute myeloid leukemia (aml) adds important prognostic information to cytogenetic risk group and response to the first cycle of chemotherapy: analysis of 854 patients from the united kingdom medical research council aml10 and 12 trials. Blood., 98:1752-1759, 2001.

LDB99- B. Lowenberg, J.R. Downing, A. Burnett. Acute myeloid leukemia. N Engl J Med, 341(1051), 1999.

LK03- Choe LH, Lee KH. Quantitative and qualitative measure of intralaboratory two-dimensional protein gel reproducibility and the effects of sample preparation, sample load and image analysis. Electrophoresis, 24(19-20):3500-3507, October 2003.

Mei79- J. Meinguet. Multivariate interpolation at arbitrary points made simple. J. Appl. Math. Phys., 30:292-304, 1979.

RSIR- CO Research Systems Inc (RSI), Boulder. Idl, the interactive data language, v6.1.

SC95- J. Stutz, P. Cheeseman. Maximum Entropy and Bayesian Methods, Cambridge 1994, rozdzia/l AutoClass - a Bayesian Approach to Classification. Kluwer Acedemic Publishers, Dordrecht, 1995.

SCFE0x- S. Serneels, C. Croux, P. Filzmoser, P.J. Van Espen. Partial robust m-regression. Chemometrics and Intelligent Laboratory Systems, to appear, 200x.

VVB+04- P.J. Valk, R.G. Verhaak, M.A. Beijen, C.A. Erpelinck, Barjesteh van Waalwijk van Doorn-S. Khosrovani, J.M. Boer, H.B. Beverloo, M.J. Moorhouse, P.J. van der Spek, B. Lowenberg, R. Delwel. Prognostically useful gene-expression profiles in acute myeloid leukemia. N Engl J Med, 16(350):1617-28, April 15 2004.

WAB+04- Craig Watman, David Austin, Nick Barnes, Gary Overett, Simon Thompson. Fast sum of absolute differences visual landmark detector. Proceedings of IEEE Conference on Robotics and Automation, April 2004.

WAG99- K. Wheatly, A.K. Burnett AK, A.H. Goldstone. A simple, robust, validated and highly predictive index for the determination of risk-directed therapy in acute myeloid leukemia derived from the mrc aml 10 trial. Br J Haematol., 107:69-79, 1999.

Wah90- G. Wahba. Spline Models for Observational Data. Philadelphia, PA: SIAM, 1990.

WF05- Xiuying Wang, David Dagan Feng. Hybrid registration for two-dimensional gel protein images. Third Asia Pacific Bioinformatics Conference (APBC2005), January 2005.

ÅIHDBG06- Nina Ånensenand I. Haaland, C. D'Santos, Werner Van Belle, Bjørn Tore Gjertsen. Proteomics of p53 in diagnostics and therapy of acute myeloid leukemia. Current Pharmaceutical Biotechnology, In press, 2006.

ØBJ05- A.M. Øyan, T.H. Bø, I Jonassen. Cd34 expression in native human acute myelogenous leukemia blasts: differences in cd34 membrane molecule expression are associated with different gene expression profiles. Cytometry B Clin Cytom, 64(1):18-27, Mar 2005.

ØBJ+06 - A.M. Øyan, T.H. Bø, I. Jonassen, B.T. Gjertsen, Ø. Bruserud, K.H. Kalland. cdna microarray analysis of non-selected cases of acute myeloid leukemia demonstrates distinct clustering independent of cytogenetic aberrations and consistent with morphological signs of differentiation. Int J Oncol, 28:1065-80, 2006.