Background:
The Cancer Imaging Archive (TCIA) is a highly popular resource to researchers studying radiology and histopathology images as a way to improve cancer detection and treatment assessment. It is visited by over 25,000 users each month who download hundreds of terabytes a data.
Project Description:
For this project, we are seeking students with a background in Python to help us develop a variety of different tools to improve our operations. Examples of specific use cases include:
- Validation tools to ensure standardization and uniformity of incoming data from different submitter sites
- De-identification tools to effectively remove protected health information from incoming images and clinical data while retaining the information required for downstream research
- Query tools that provide new ways for users to filter and download datasets from the site
- Data Retrieval tools to provide new ways for users to access our datasets
- Development of code to simplify the utilization of medical imaging file formats such as DICOM for use in general purpose artificial intelligence tools and Python packages (e.g. MONAI, nnUNet)
- Visualization tools to assist with understanding different metrics about how data are used
If matched, we would work with you to identify the types of problems you're most interested in working on and suggest more specific ideas for how you could contribute in these areas. Check out the additional reading materials for more background info and some existing tools that could be improved or which might inspire new ideas.
- Fall mentor time: Tuesday: 3:30 PM Eastern
- Fall lab time: Thursday: 3:30 PM Eastern
- Industry: Biotechnology
- Tools: python
- Topics: databases, visualizations
- Requirements: Open to all students