National NLP Clinical Challenges (n2c2)
Announcement of Data Release and Call for Participation
2014 i2b2/UTHealth Shared-Tasks and Workshop
on Challenges in Natural Language Processing for Clinical Data
Tentative Timeline
Registration: begins March 14, 2014
Training Data Release: 1 May, 2014
Test Data Release: 1 July, 2014
System Outputs Due: 3 July, 2014
Systems Due: 1 September, 2014
Paper Submission: 1 September, 2014
Workshop: November 14, 2014
UPDATE: The 2014 Workshop Registration online form is now available. Please submit the form before November 14, 2014.
There are now two new tracks for this year's i2b2 shared tasks: one on clinical NLP software usability, and another on novel data use for the new corpus. Please click here for more information.
The 2014 i2b2/UTHealth challenge consists of two traditional NLP tracks:
The data for this task is provided by Partners HealthCare. All records have been fully de-identified and manually annotated for risk factors related to diabetes and heart disease risk factors.
Data for the challenge will be released under a Rules of Conduct and Data Use Agreement. Obtaining the data requires completing a registration, which will start March 14, 2014.
For either track, i2b2 will give the participants the option to submit their system software in addition to their system output for evaluation. Teams that submit software will be evaluated separately from teams that submit only system output. These systems will be evaluated for useability. Please see our upcoming software useability evaluation track for guidelines on expectations.
In addition, the i2b2 organizing committee is currently considering hosting a general "i2b2 Software for Clinical NLP" track that allows all past and current i2b2 challenge participants to submit and share with the community any software developed for any i2b2 shared-task.
Evaluation Dates, File Formats, and Evaluation Metrics
The evaluation for the NLP tracks will be conducted using withheld test data. Participating teams are asked to stop development as soon as they download the test data. Each team is allowed to upload (through this website) up to three system runs for each of the tasks. System output is expected in the form of standoff annotations, following the exact format of the ground truth annotations to be provided by the organizers.
Evaluation of submitted software will be evaluated by the program committee on factors such as ease of installation and ease of use.
Participants are asked to submit a short paper describing their system and analyzing their performance. Papers should be in AMIA style and should not exceed five pages. Authors of top performing systems and of particularly novel approaches will be invited to present or demo their systems at the workshop. Submitted software can be presented at the final workshop in the form of a poster or live demo.
UPDATE: The 2014 Workshop Registration online form is now available. Please submit the form before November 14, 2014.
We are pleased to announce two new tracks, added in addition to the NLP tracks 1 (de-identification) and 2 (heart disease risk factors) for the 2014 shared task and workshop.
The 2014 i2b2/UTHealth Challenge now introduces two additional tracks:
Participants are encouraged to define their own questions to ask of these data.
In addition, participants of this track can contribute visualizations on NLP output for Tracks 1 and 2, or provide novel de-identification methods. Note that this track uses data from Tracks 1 and 2 and will follow the same data release timeline as those tracks.
The data for the 2014 i2b2 Challenge is provided by Partners HealthCare. All records have been fully de-identified and manually annotated for risk factors related to diabetes and heart disease risk factors.
Data for the challenge is released under a Rules of Conduct and Data Use Agreement. Obtaining the data requires completing a registration, which started March 14, 2014. Training data was released on May 1, 2014 and is now available to registered users.
Evaluation Dates, File Formats, and Evaluation Metrics
Evaluation of software submitted to Track 3: Usability evaluation will be performed with a relevant use case for the specific task and using formal usability testing software. Usability evaluators will be students and staff affiliated with the University Of Michigan Center for Managing Chronic Disease who have extensive experience working with patient charts and patients with complex chronic diseases. The evaluation criteria and procedures include:
Particpants in Track 3 are asked to email Dr. Anupama E. Gururaj (Anupama.E.Gururaj@uth.tmc.edu) the following information:
* denotes a required field
Track 4 will be evaluated on the basis of papers to be submitted to the organizing committee. Due to the open-ended nature of this track, there will be no system comparison or ranking. The aim is to demonstrate creative, clinically-relevant and varied uses of the Track 1 and Track 2 data. Therefore, papers describing novel and well-executed systems will be considered for publication in a journal special issue related to this shared task.
Participants in Track 4 are asked to submit a short paper describing their system and analyzing their performance. Papers should be in AMIA style and should not exceed five pages. Authors of top performing systems and of particularly novel approaches will be invited to present or demo their systems at the workshop. Submitted software can be presented at the final workshop in the form of a poster or live demo.
Organizing Committee:
Ozlem Uzuner, co-chair, SUNY at Albany
Amber Stubbs, co-chair, SUNY at Albany
Hua Xu, co-chair, University of Texas, Houston
John Aberdeen, MITRE
Susanne Churchill, Partners Healthcare
Cheryl Clark, MITRE
Dina Demner Fushman, NIH/NLM
Joshua Denny, Vanderbilt University
Bill Hersh, Oregon Health and Science University
Lynette Hirschman, MITRE
Issac Kohane, Partners Healthcare
Vishesh Kumar, Massachusetts General Hospital
Anna Rumshisky, UMass Lowell
Stanley Shaw, Massachusetts General Hospital
Peter Szolovits, MIT
Meliha Yetisgen, University of Washington
Kai Zheng, University of Michigan
Please see the announcements for more information. Questions on the challenge can be addressed to Ozlem Uzuner, n2c2.challenges@gmail.com.