Hello every one I am Daniel Fernandez, from Spain. I am new to cTAKES and I am wondering about if it is possible to use cTAKES to parse and annotate text in Spanish. I have already found a thread in this list of a developer trying to achieve the same, called cTAKES Translation <http://mail-archives.apache.org/mod_mbox/ctakes-dev/201311.mbox/%3cf42b1eaf-622f-453d-aef0-44f002850...@upm.es%3E>. However, I am not sure about how to implement the solution, neither if it is possible due to the availability of training data. I have some general questions about this:
- Is there any public corpus adequate to train the ML-based models for Spanish? - Does The International Edition of UMLS which works with cTAKES by default include Spanish content? Or should it be provided in some other way? - The process just consist on including new models and new dictionaries? Or should we make further adaptations in the code? I would also like to know if this is actually possible, in Spanish or any other language different to English. Just to confirm that cTAKES may fit in my requirements and to have a reference of how to start. Thank you for your help. Sincerelly, Daniel Fernandez <https://www.avast.com/sig-email?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail> Libre de virus. www.avast.com <https://www.avast.com/sig-email?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail> <#DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2>