Hi Armando, I'm working on a Clojurej library for sentiment analysis which doesn't contain everything you'd want for nlp but quite a nice subset of input modules (plain text corpora, rss feeds, html, etc...), tokenising/normalising filters (noise removal, porter stemmer, etc), distance/similarity metrics (euclidean, cosine, peasrsons, Jaccard/Tanimoto), b-o-w vector representation, clustering (hierarchical, k-means), classification (NN, Bayes, knn), and some other little tidbits to tie up loose ends. There will be a first release in about 2-3 weeks time. If you're planning on doing work in that direction, maybe we could join forces :) Kind Regards Andreas On 20/05/2011, at 5:12 AM, Armando Blancas wrote:
> Just in case I'll mention that Meikel's use of (with-open) will > automatically close the reader. > > On May 19, 11:40 am, dokondr <doko...@gmail.com> wrote: >> On May 19, 6:52 pm, Meikel Brandmeyer <m...@kotka.de> wrote: >> >>> Hi, >> >>> something like the following should work. >> >>> (with-open [rdr (java.io.FileReader. "file.txt")] >>> (doseq [line (line-seq rdr) >>> word (.split line "\\s")] >>> (when (.endsWith word "ing") >>> (println word)))) >> >>> Sincerely >>> Meikel >> >> Thanks everybody! The short one from Meikel (above) looks nice to >> me :) >> >> And the one from ClojureDocs too: >> >> (defn read-lines >> "Like clojure.core/line-seq but opens f with reader. Automatically >> closes the reader AFTER YOU CONSUME THE ENTIRE SEQUENCE." >> [f] >> (let [read-line (fn this [^BufferedReader rdr] >> (lazy-seq >> (if-let [line (.readLine rdr)] >> (cons line (this rdr)) >> (.close rdr))))] >> (read-line (reader f)))) > > -- > You received this message because you are subscribed to the Google > Groups "Clojure" group. > To post to this group, send email to clojure@googlegroups.com > Note that posts from new members are moderated - please be patient with your > first post. > To unsubscribe from this group, send email to > clojure+unsubscr...@googlegroups.com > For more options, visit this group at > http://groups.google.com/group/clojure?hl=en -- "Test-driven Dentistry (TDD!) - Not everything should be test driven" - Michael Fogus -- ********************************************************** Andreas Koestler, Software Engineer Leica Geosystems Pty Ltd 270 Gladstone Road, Dutton Park QLD 4102 Main: +61 7 3891 9772 Direct: +61 7 3117 8808 Fax: +61 7 3891 9336 Email: andreas.koest...@leica-geosystems.com ************www.leica-geosystems.com************* when it has to be right, Leica Geosystems Please consider the environment before printing this email. -- You received this message because you are subscribed to the Google Groups "Clojure" group. To post to this group, send email to clojure@googlegroups.com Note that posts from new members are moderated - please be patient with your first post. To unsubscribe from this group, send email to clojure+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/clojure?hl=en