This thesis focuses on modernizing bioinformatics workflows, which are often hindered by outdated tools and manual processes, to improve efficiency and automation. It presents a general architecture designed to support a wide range of bioinformatics applications, deployable within a High-Performance Computing (HPC) environment, with a user-friendly interface. The system provides key services such as authentication, data storage, and job scheduling, allowing users to interact with the platform effortlessly while abstracting away computational complexities. The architecture was implemented and tested by developing a workflow for the HashClone suite, a tool to detect the major clones and to follow them in longitudinal samples. The workflow offers clonal population analytics, specialized charts for visualizing relevant clones and trends, and IMGT/V-QUEST analysis for cross-referencing data, enabling researchers to visually analyze data. The system revolves around a user workspace where datasets and computed outputs are stored, ensuring accessibility and reproducibility. This project provides a scalable foundation for future bioinformatics applications, improving usability by bridging the gap between non-technical users and complex computational processes, enabling researchers to leverage computational power. By offering accessible, up-to-date tools, it paves the way for advancing research and enhancing the potential of researchers in the field.
This thesis focuses on modernizing bioinformatics workflows, which are often hindered by outdated tools and manual processes, to improve efficiency and automation. It presents a general architecture designed to support a wide range of bioinformatics applications, deployable within a High-Performance Computing (HPC) environment, with a user-friendly interface. The system provides key services such as authentication, data storage, and job scheduling, allowing users to interact with the platform effortlessly while abstracting away computational complexities. The architecture was implemented and tested by developing a workflow for the HashClone suite, a tool to detect the major clones and to follow them in longitudinal samples. The workflow offers clonal population analytics, specialized charts for visualizing relevant clones and trends, and IMGT/V-QUEST analysis for cross-referencing data, enabling researchers to visually analyze data. The system revolves around a user workspace where datasets and computed outputs are stored, ensuring accessibility and reproducibility. This project provides a scalable foundation for future bioinformatics applications, improving usability by bridging the gap between non-technical users and complex computational processes, enabling researchers to leverage computational power. By offering accessible, up-to-date tools, it paves the way for advancing research and enhancing the potential of researchers in the field.
Potenziamento dell'Analisi e dell'Interpretazione dei Dati sui Linfomi con High Performance Computing: Archiviazione, Computazione, Visualizzazione
PEROTTINO, ELISA LI
2023/2024
Abstract
This thesis focuses on modernizing bioinformatics workflows, which are often hindered by outdated tools and manual processes, to improve efficiency and automation. It presents a general architecture designed to support a wide range of bioinformatics applications, deployable within a High-Performance Computing (HPC) environment, with a user-friendly interface. The system provides key services such as authentication, data storage, and job scheduling, allowing users to interact with the platform effortlessly while abstracting away computational complexities. The architecture was implemented and tested by developing a workflow for the HashClone suite, a tool to detect the major clones and to follow them in longitudinal samples. The workflow offers clonal population analytics, specialized charts for visualizing relevant clones and trends, and IMGT/V-QUEST analysis for cross-referencing data, enabling researchers to visually analyze data. The system revolves around a user workspace where datasets and computed outputs are stored, ensuring accessibility and reproducibility. This project provides a scalable foundation for future bioinformatics applications, improving usability by bridging the gap between non-technical users and complex computational processes, enabling researchers to leverage computational power. By offering accessible, up-to-date tools, it paves the way for advancing research and enhancing the potential of researchers in the field.File | Dimensione | Formato | |
---|---|---|---|
ElisaLiPerottino_861222.pdf
non disponibili
Dimensione
8.57 MB
Formato
Adobe PDF
|
8.57 MB | Adobe PDF |
I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/20.500.14240/164332