This is exactly what I do and it works great!
On Saturday, January 11, 2014 7:00:22 PM UTC-5, Jan Herich wrote:
>
> I don't recommend using java's built in HTTP retrieval (by passing
> java.net.URL object to enlive html-resource function).
> Not only is it significantly slower then using clj-http
I don't recommend using java's built in HTTP retrieval (by passing
java.net.URL object to enlive html-resource function).
Not only is it significantly slower then using clj-http (which uses
apache-http client under the hood), but it's also unreliable
when issuing more parallel requests.
Current
Java has HTTP retrieval built in. Clojure's core functions can use file or
http URLs:
user> (slurp "http://google.com";)
user> (slurp "file:///etc/passwd")
Parsing HTML on the other hand is a question of not just science but also
art. Doesn't enlive use Tag Soup?
--
--
You received this m
I was using net.cgrand.enlive-html/html-resource and org.httpkit.client/get
for the page retrievals.
On Saturday, January 11, 2014 6:24:48 PM UTC-5, Dave Tenny wrote:
>
> I'm just playing around with tool kits to retrieve and parse html from web
> pages and files that I already have on disk (suc
I'm just playing around with tool kits to retrieve and parse html from web
pages and files that I already have on disk (such as JDK API documentation).
Based on too little time, it looks like [http-kit "2.1.16"] will retrieve
but not parse html, and [enlive "1.1.5"] will retrieve AND parse html.