Organizers: Olga Vitek and Michael Shortreed

Description

This workshop brings together developers (and aspiring developers) of computational and statistical tools for mass spectrometry, proteomics and metabolomics. The program will include tutorials by leading developers of tools, invited oral presentations, oral and poster presentations selected from submitted abstracts, a panel discussion, as well as ample opportunities for informal interactions..

We invite everyone to submit abstracts describing tools for mass spectrometry, proteomics and metabolomics. Of particular interest are topics such as treatment of match-between-runs, normalization of multi-batch experiments, and treatment of missing values, however all topics are welcome. The abstracts should focus less on the specific functionality, and more on the design of the tool, and its strategy for enabling reproducible research, sustainability, and inter-operability with other computational tools.

A subset of the abstracts will be selected for oral or poster presentations. All accepted presenters will receive a free admission to this part of the program.

Target audience

Anyone interested in developing computational and statistical tools for mass spectrometry, proteomics and metabolomics.

Meeting Agenda

Below is the draft of the 2025 meeting agenda. More updates and additions will be posted here soon.

Location: Northeastern University Main Campus, Boston MA (Room TBA)

Saturday, May 10

May 10, 9:00am – 12:30pm Tutorial: Knowledge-driven reconstruction of context-specific molecular networks using proteomics.
This session combines theoretical insights and practical approaches to context-specific biological network inference, leveraging prior knowledge from OmniPath. We will explore the foundational concepts and methodologies, including the various levels of information provided by prior knowledge, the trade-off between coverage and confidence, and both basic and advanced techniques for single- and multi-condition analyses using tools from NetworkCommons and CORNETO. The session will also feature an open discussion on the major limitations in the field and the strategies being developed to address these challenges. For the practical part of the session, attendees should have a basic understanding of Python programming and a Google account to run notebooks in Google Colab.
Speaker: Martin Garrido
May 10, 12:30pm – 1:30pm Lunch break and posters
May 10, 1:30pm – 5:00pm Tutorial: Challenges in scalable computing for biological data.
Languages like R and Python provide an accessible introduction to programming for analyzing biological data through communities like Bioconductor and Biopython. However, scaling workflows to analyze massive datasets efficiently can be very difficult and often requires significant programming expertise. We will discuss common challenges in parallel processing and distributed computing for biological datasets, current techniques for dealing with these challenges, and future directions the community can take to improve the state of the art. Participants should be comfortable programming in R or Python with an openness to learning concepts from other languages
Speaker: Kylie Bemis
May 10, 6:00pm – 9:00pm Future developers meeting dinner and scientific panel
Location: TBA, all participants are invited

Sunday, May 11

Additional speakers TBA

May 11, 9:00am – 10:30am Tutorial: Missingness-informed protein quantification and differential expression (DE) analysis powered by limma and limpa.
In this lecture, we will introduce the suite of limpa and limma methods for differential expression (DE) analysis in mass spectrometry (MS)-based proteomics data. The new pipelines are based on the detection probability curve (DPC; Li and Smyth, 2023), which is a probabilistic model for non-ignorable missingness in MS-based data. DPC-based protein quantification takes missing values into account which significantly improves the DE analysis. We will also have a brief discussion on the best practices for exploratory data analysis, batch correction and normalization of MS data. Attendees should have basic to intermediate R coding skills.
Speaker: Mengbo Li
May 11, 10:30am – 11:00am Break and refreshments
May 11, 11:00am – 12:30pm Invited and contributed talks
Invited speaker: Brian Searle, Ohio State University, Title TBA
Additional speakers TBA
May 11, 12:30pm – 1:30pm Lunch break and posters
May 11, 1:30pm – Invited and contributed talks – Open-ended
Additional speakers TBA

 

Practical Details

Application

You may apply to attend the Future Developers Meeting via the May Institute application form. Deadlines are the same as for May Institute.

Talk/Poster Requirements

Must be your original research. Talks should be 15 minutes or less. Posters size roughly A0. Please include a max 1 page abstract with your application, or email it to mayinstitute@ccs.neu.edu by the May Institute application deadline.