[ https://issues.apache.org/jira/browse/CTAKES-510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rehan Ahmed updated CTAKES-510: ------------------------------- Description: *Disclaimer: This is the first time I am putting a requirement here* Lesions/tumors sizes are generally noted as area or volume in clinical text. The current MeasurementAnnotation annotates only uni-dimensional lengths such as "1 cm". I propose an enhancement of this annotation to be able to detect multidimensional measurements of the form "a x b x c", i.e., numbers delimited by the character 'x'. |modified: [ctakes-core/src/main/java/org/apache/ctakes/core/nlp/tokenizer/TokenizerPTB.java|https://github.com/ahmeshaf/ctakes/commit/43dc84e69d82530c835a34b1cfa115c329719123#diff-704787dbe87a4796dc48c047354280b3] modified: ctakes-core/src/main/java/org/apache/ctakes/core/fsm/machine/MeasurementFSM.java| |added: ctakes-core/src/main/java/org/apache/ctakes/core/nlp/tokenizer/MeasurementPTB.java| was: *Disclaimer: This is the first time I am putting a requirement here* Lesions/tumors sizes are generally noted as area or volume in clinical text. The current MeasurementAnnotation annotates only uni-dimensional lengths such as "1 cm". I propose an enhancement of this annotation to be able to detect multidimensional measurements of the form "a x b x c", i.e., numbers delimited with the character 'x'. |modified: [ctakes-core/src/main/java/org/apache/ctakes/core/nlp/tokenizer/TokenizerPTB.java|https://github.com/ahmeshaf/ctakes/commit/43dc84e69d82530c835a34b1cfa115c329719123#diff-704787dbe87a4796dc48c047354280b3] modified: ctakes-core/src/main/java/org/apache/ctakes/core/fsm/machine/MeasurementFSM.java| |added: ctakes-core/src/main/java/org/apache/ctakes/core/nlp/tokenizer/MeasurementPTB.java| > Ability to annotate multidimensional measurements (area, volume) > ---------------------------------------------------------------- > > Key: CTAKES-510 > URL: https://issues.apache.org/jira/browse/CTAKES-510 > Project: cTAKES > Issue Type: New Feature > Components: ctakes-core > Affects Versions: 4.0.0 > Reporter: Rehan Ahmed > Priority: Minor > Labels: newbie, patch > Fix For: future enhancement > > Attachments: ctakes.zip > > Original Estimate: 168h > Remaining Estimate: 168h > > *Disclaimer: This is the first time I am putting a requirement here* > Lesions/tumors sizes are generally noted as area or volume in clinical text. > The current MeasurementAnnotation annotates only uni-dimensional lengths such > as "1 cm". I propose an enhancement of this annotation to be able to detect > multidimensional measurements of the form "a x b x c", i.e., numbers > delimited by the character 'x'. > |modified: > [ctakes-core/src/main/java/org/apache/ctakes/core/nlp/tokenizer/TokenizerPTB.java|https://github.com/ahmeshaf/ctakes/commit/43dc84e69d82530c835a34b1cfa115c329719123#diff-704787dbe87a4796dc48c047354280b3] > modified: > ctakes-core/src/main/java/org/apache/ctakes/core/fsm/machine/MeasurementFSM.java| > |added: > ctakes-core/src/main/java/org/apache/ctakes/core/nlp/tokenizer/MeasurementPTB.java| -- This message was sent by Atlassian JIRA (v7.6.3#76005)