How large? And across how many EMRs?
JG — Sent from Mailbox On Mon, Sep 29, 2014 at 6:58 PM, Ajay Jain <ajayj...@mobileinsights.net> wrote: > Sorry, I wasn't clear. I am working on a related project and trying to figure > out if the code can be repurposed for a lab mention annotator for cTAKES. > From what I have seen, test names from different institutions are not > standardized which makes it hard to standardize the resulting annotation. > Getting access to a larger lab tests dataset (structured) will help me fine > tune the model. > > Hope this helps. > Ajay > Sent from my iPhone >> On Sep 29, 2014, at 2:12 PM, "Savova, Guergana" >> <guergana.sav...@childrens.harvard.edu> wrote: >> >> Ajay, >> cTAKES currently does not implement a method to discover labs from the text. >> The motivation is that you can get that easily from the structured part of >> the EMR (what Pete explained below). Hope this makes sense! >> --Guergana >> >> -----Original Message----- >> From: Peter Szolovits [mailto:p...@mit.edu] >> Sent: Monday, September 29, 2014 2:32 PM >> To: dev@ctakes.apache.org >> Subject: Re: De-identified lab tests dataset >> >> Ajay, I'm confused by your query. cTakes is good at interpreting text, but >> most lab test results are reported in tabular form that is most >> appropriately searched by SQL queries. Sometimes lab results are also >> reported in narrative notes, but parsing those is often more a matter of >> deciphering the text structure of tables than of parsing real English text. >> What am I misunderstanding? >> >> --Pete Sz. >> >>> On Sep 29, 2014, at 2:25 PM, Ajay Jain <ajayj...@mobileinsights.net> wrote: >>> >>> Hello All, >>> >>> I am working on a use case for lab tests data using cTAKES and my >>> online search to find a test dataset has been futile. I'll greatly >>> appreciate if someone can share such a dataset or can point me in the >>> right direction to go looking for one. >>> >>> Best, >>> Ajay >>> >>> -- >>> Founder & CEO >>> Mobile Insights, Inc. >>> (630) 408-8623 >>