Re: [VOTE] accept UIMA as a podling

Eric Nyberg Wed, 20 Sep 2006 17:41:42 -0700

Greetings,

I understand from the message traffic that there are some concerns aboutthe current state of the UIMA proposal, but I'd like to offer my support(and my hope that the issues with the proposal are resolved).

Carnegie Mellon has been building and deploying text analysis programsusing pluggable components for the last 3-4 years. Large-scale textanalysis (e.g., for text data mining, populating a knowledge base, etc.)requires significant programming at many different levels of textrepresentation (segmentation into sentences and tokens; recognition ofbasic entities such as organization names and person names; analysis ofgrammatical structure (parse trees); assignment of domain specificmeaning to parse trees; etc.).

Until UIMA came along, there was standard for how all these separateanalysis steps could be integrated, and those of us trying to buildend-to-end applications had to either write everything ourselves using aone-off proprietary design, or spend lots of time writing wrapper codeto integrate existing components that didn't share the same underlyingdata model.

UIMA provides all the necessary ingredients to ease these issues. Thedata models used by individual components are represented by formal typesystems; the components themselves implement (or are wrapped byimplementations of) well-designed abstract interfaces; and tools areprovided for creating aggregate analysis engines which integratecomponents in (possibly distributed) run-time configurations. The factthat IBM has made UIMA open source, and is searching for an appropriateopen-source development venue, represents a significant opportunity. Ifthings continue to move ahead, I expect that the students and staffworking with me will be contributing cycles to the development effort.

In addition to using UIMA on various R&D projects at CMU-LTI, we're alsousing UIMA in our Software Engineering course to teach architecturaldesign for text analysis(http://durazno.lti.cs.cmu.edu/wiki/moin.cgi/11-792). Our studentsrecently created the UIMA Component Repository (uima.lti.cs.cmu.edu),which we are promoting as a venue for sharing of completed components,type systems, and end-to-end solutions.


Eric Nyberg
Associate Professor
Language Technologies Institute
School of Computer Science
Carnegie Mellon University

Ian Holsman wrote:

Hi,

There has been some discussion around the UIMA proposal,
we feel that all the issues forwarded have been addressed, and we
would now like to officially propose UIMA to the Incubator for
consideration.


The proposal can be found in the Incubator wiki here:
http://wiki.apache.org/incubator/UIMA

[ ] +1 Accept UIMA as an Incubator podling
[ ]  0 Don't care
[ ] -1 Reject this proposal for the following reason:


--
Ian Holsman
[EMAIL PROTECTED]
http://parent-chatter.com -- what do parents know?



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [VOTE] accept UIMA as a podling

Reply via email to