Hi, I am using prof. Temple Lang's suggestions and I think I should be close but with the code below I get an error message which I don't fully get. Any suggestions? Thanks!
Math library(RCurl) library(XML) setwd("C:/Comments") url <- getURLContent("http://www.scopus.com/results/results.url?sort=plf-f&src=s&sid=M8RcnaPRBgrtA1r_EvZtL7j%3a70&sot=a&sdt=a&sl=32&s=PMID%2811693556%29+OR+PMID%2812239288%29&origin=searchadvanced&txGid=M8RcnaPRBgrtA1r_EvZtL7j%3a7", options(RCurlOptions = list(proxy = "127.0.0.1:2048", proxyuserpwd = "username:password", proxyauth = "gci")), cookiefile = "/Rcookies") Error in curlOptions(..., .opts = .opts) : unnamed curl option(s): list(RCurlOptions = list(proxy = "127.0.0.1:2048", proxyuserpwd = "username:password", proxyauth = "gci")) Duncan Temple Lang wrote > > Apologies for following up on my own mail, but I forgot > to explicitly mention that you will need to specify the > appropriate proxy information in the call to getURLContent(). > > D. > > On 6/7/12 8:31 AM, Duncan Temple Lang wrote: >> To just enable cookies and their management, use the cookiefile >> option, e.g. >> >> txt = getURLContent(url, cookiefile = "") >> >> Then you can pass this to readHTMLTable(), best done as >> >> content = readHTMLTable(htmlParse(txt, asText = TRUE)) >> >> >> The function readHTMLTable() doesn't use RCurl and doesn't >> handle cookies. >> >> D. >> >> On 6/7/12 7:33 AM, mdvaan wrote: >>> Hi, >>> >>> I am trying to access a website and read its content. The website is a >>> restricted access website that I access through a proxy server (which >>> therefore requires me to enable cookies). I have problems in allowing >>> Rcurl >>> to receive and send cookies. >>> >>> The following lines give me: >>> >>> library(RCurl) >>> library(XML) >>> >>> url <- "http://www.theurl.com" >>> content <- readHTMLTable(url) >>> >>> content >>> $`NULL` >>> >>> >>> >>> V1 >>> 1 >>> >>> >>> 2 >>> >>> >>> Cookies disabled >>> 3 >>> >>> >>> 4 Your browser currently does not accept cookies.\rCookies need to be >>> enabled for Scopus to function properly.\rPlease enable session cookies >>> in >>> your browser and try again. >>> >>> $`NULL` >>> V1 V2 V3 >>> 1 >>> >>> $`NULL` >>> V1 >>> 1 Cookies disabled >>> >>> $`NULL` >>> V1 >>> 1 >>> 2 >>> 3 >>> >>> I have carefully read section 4.4. from this: >>> http://www.omegahat.org/RCurl/RCurlJSS.pdf and tried the following >>> without >>> succes: >>> >>> curl <- getCurlHandle() >>> curlSetOpt(cookiejar = 'cookies.txt', curl = curl) >>> >>> Any suggestions on how to allow for cookies? >>> >>> Thanks. >>> >>> Math >>> >>> -- >>> View this message in context: >>> http://r.789695.n4.nabble.com/How-to-set-cookies-in-RCurl-tp4632693.html >>> Sent from the R help mailing list archive at Nabble.com. >>> >>> ______________________________________________ >>> R-help@ mailing list >>> https://stat.ethz.ch/mailman/listinfo/r-help >>> PLEASE do read the posting guide >>> http://www.R-project.org/posting-guide.html >>> and provide commented, minimal, self-contained, reproducible code. >> >> ______________________________________________ >> R-help@ mailing list >> https://stat.ethz.ch/mailman/listinfo/r-help >> PLEASE do read the posting guide >> http://www.R-project.org/posting-guide.html >> and provide commented, minimal, self-contained, reproducible code. > > ______________________________________________ > R-help@ mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > -- View this message in context: http://r.789695.n4.nabble.com/How-to-set-cookies-in-RCurl-tp4632693p4634147.html Sent from the R help mailing list archive at Nabble.com. ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.