Sorry, I wasn't clear. I am working on a related project and trying to figure out if the code can be repurposed for a lab mention annotator for cTAKES. From what I have seen, test names from different institutions are not standardized which makes it hard to standardize the resulting annotation. Getting access to a larger lab tests dataset (structured) will help me fine tune the model. Hope this helps.
Ajay Sent from my iPhone > On Sep 29, 2014, at 2:12 PM, "Savova, Guergana" > <guergana.sav...@childrens.harvard.edu> wrote: > > Ajay, > cTAKES currently does not implement a method to discover labs from the text. > The motivation is that you can get that easily from the structured part of > the EMR (what Pete explained below). Hope this makes sense! > --Guergana > > -----Original Message----- > From: Peter Szolovits [mailto:p...@mit.edu] > Sent: Monday, September 29, 2014 2:32 PM > To: dev@ctakes.apache.org > Subject: Re: De-identified lab tests dataset > > Ajay, I'm confused by your query. cTakes is good at interpreting text, but > most lab test results are reported in tabular form that is most appropriately > searched by SQL queries. Sometimes lab results are also reported in > narrative notes, but parsing those is often more a matter of deciphering the > text structure of tables than of parsing real English text. What am I > misunderstanding? > > --Pete Sz. > >> On Sep 29, 2014, at 2:25 PM, Ajay Jain <ajayj...@mobileinsights.net> wrote: >> >> Hello All, >> >> I am working on a use case for lab tests data using cTAKES and my >> online search to find a test dataset has been futile. I'll greatly >> appreciate if someone can share such a dataset or can point me in the >> right direction to go looking for one. >> >> Best, >> Ajay >> >> -- >> Founder & CEO >> Mobile Insights, Inc. >> (630) 408-8623 >