%0 Journal Article %A Chulian García, Salvador %A Stolz, Bernadette J. %A Martínez Rubio, Álvaro %A Blázquez, Cristina %A Rodríguez‑Gutiérrez, Juan Francisco %A Caballero-Velázquez, Teresa %A Molinos Quintana, Águeda %A Ramírez-Orellana, Manuel %A Castillo Robleda, Ana %A Fuster Soler, Luis %A Minguela Puras, Alfredo %A Martínez-Sánchez, María V. %A Rosa Durán, María %A Pérez-García, Víctor M. %A Byrne, Helen M. %T The shape of cancer relapse: Topological data analysis predicts recurrence in paediatric acute lymphoblastic leukaemia %D 2023 %U http://hdl.handle.net/10498/30917 %X Although children and adolescents with acute lymphoblastic leukaemia (ALL) have high sur- vival rates, approximately 15-20% of patients relapse. Risk of relapse is routinely estimated at diagnosis by biological factors, including flow cytometry data. This high-dimensional data is typically manually assessed by projecting it onto a subset of biomarkers. Cell density and “empty spaces” in 2D projections of the data, i.e. regions devoid of cells, are then used for qualitative assessment. Here, we use topological data analysis (TDA), which quantifies shapes, including empty spaces, in data, to analyse pre-treatment ALL datasets with known patient outcomes. We combine these fully unsupervised analyses with Machine Learning (ML) to identify significant shape characteristics and demonstrate that they accurately pre- dict risk of relapse, particularly for patients previously classified as ‘low risk’. We indepen- dently confirm the predictive power of CD10, CD20, CD38, and CD45 as biomarkers for ALL diagnosis. Based on our analyses, we propose three increasingly detailed prognostic pipe- lines for analysing flow cytometry data from ALL patients depending on technical and tech- nological availability: 1. Visual inspection of specific biological features in biparametric projections of the data; 2. Computation of quantitative topological descriptors of such projec- tions; 3. A combined analysis, using TDA and ML, in the four-parameter space defined by CD10, CD20, CD38 and CD45. Our analyses readily extend to other haematological malignancies. %~ Universidad de Cádiz