The effort involved in creating accurate ground truth segmentation maps hinders advances in machine learningapproaches to tumor delineation in clinical positron emission tomography (PET) scans. To address this challenge,we propose a fully convolutional network (FCN) model to delineate tumor volumes from PET scans automaticallywhile relying on weak annotations in the form of bounding boxes (without delineations) around tumor lesions.To achieve this, we propose a novel loss function that dynamically combines a supervised component, designed toleverage the training bounding boxes, with an unsupervised component, inspired by the Mumford-Shah piecewiseconstant level-set image segmentation model. The model is trained end-to-end with the proposed differentiableloss function and is validated on a public clinical PET dataset of head and neck tumors. Using only boundingbox annotations as supervision, the model achieves competitive results with state-of-the-art supervised and semi-automatic segmentation approaches. Our proposed approach improves the Dice similarity by approximately 30%and reduces the unsigned distance error by approximately 7 mm compared to a model trained with only boundingboxes (weak supervision). Also, after the post-processing step (morphological operations), our weak supervisionapproach differs only 7% in terms of the Dice similarity from the quality of the fully supervised model, forsegmentation task.
展开▼