[ 
https://issues.apache.org/jira/browse/CTAKES-510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rehan Ahmed updated CTAKES-510:
-------------------------------
    Description: 
*Disclaimer: This is the first time I am putting a requirement here*

Lesions/tumors sizes are generally noted as area or volume in clinical text. 
The current MeasurementAnnotation annotates only uni-dimensional lengths such 
as "1 cm". I propose an enhancement of this annotation to be able to detect 
multidimensional measurements of the form "a x b x c", i.e., numbers delimited 
with the character 'x'.


|modified: 
[ctakes-core/src/main/java/org/apache/ctakes/core/nlp/tokenizer/TokenizerPTB.java|https://github.com/ahmeshaf/ctakes/commit/43dc84e69d82530c835a34b1cfa115c329719123#diff-704787dbe87a4796dc48c047354280b3]
modified: 
ctakes-core/src/main/java/org/apache/ctakes/core/fsm/machine/MeasurementFSM.java|
|added: 
ctakes-core/src/main/java/org/apache/ctakes/core/nlp/tokenizer/MeasurementPTB.java|

  was:
*Disclaimer: This is the first time I am putting a requirement here*

Lesions/tumors sizes are generally noted as area or volume in clinical text. 
The current MeasurementAnnotation annotates only uni-dimensional lengths such 
as "1 cm". I propose an enhancement of this annotation to be able to detect 
multidimensional measurements of the form "a x b x c", i.e., numbers delimited 
with the character 'x'.

modified: 
[ctakes-core/src/main/java/org/apache/ctakes/core/fsm/machine/MeasurementFSM.java
|https://github.com/ahmeshaf/ctakes/commit/43dc84e69d82530c835a34b1cfa115c329719123#diff-4ec6dfb97bc6e6a71176a8d7e6c3408b]modified:
 
[ctakes-core/src/main/java/org/apache/ctakes/core/nlp/tokenizer/TokenizerPTB.java|https://github.com/ahmeshaf/ctakes/commit/43dc84e69d82530c835a34b1cfa115c329719123#diff-704787dbe87a4796dc48c047354280b3]
added: 
[ctakes-core/src/main/java/org/apache/ctakes/core/nlp/tokenizer/MeasurementPTB.java
|https://github.com/ahmeshaf/ctakes/commit/43dc84e69d82530c835a34b1cfa115c329719123#diff-879b985c4d6042a04a4054df2949a179]


> Ability to annotate multidimensional measurements (area, volume)
> ----------------------------------------------------------------
>
>                 Key: CTAKES-510
>                 URL: https://issues.apache.org/jira/browse/CTAKES-510
>             Project: cTAKES
>          Issue Type: New Feature
>          Components: ctakes-core
>    Affects Versions: 4.0.0
>            Reporter: Rehan Ahmed
>            Priority: Minor
>              Labels: newbie, patch
>             Fix For: future enhancement
>
>         Attachments: ctakes.zip
>
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> *Disclaimer: This is the first time I am putting a requirement here*
> Lesions/tumors sizes are generally noted as area or volume in clinical text. 
> The current MeasurementAnnotation annotates only uni-dimensional lengths such 
> as "1 cm". I propose an enhancement of this annotation to be able to detect 
> multidimensional measurements of the form "a x b x c", i.e., numbers 
> delimited with the character 'x'.
> |modified: 
> [ctakes-core/src/main/java/org/apache/ctakes/core/nlp/tokenizer/TokenizerPTB.java|https://github.com/ahmeshaf/ctakes/commit/43dc84e69d82530c835a34b1cfa115c329719123#diff-704787dbe87a4796dc48c047354280b3]
> modified: 
> ctakes-core/src/main/java/org/apache/ctakes/core/fsm/machine/MeasurementFSM.java|
> |added: 
> ctakes-core/src/main/java/org/apache/ctakes/core/nlp/tokenizer/MeasurementPTB.java|



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to