[R] Grap Element from Web Page

Sparks, John James Tue, 13 Aug 2013 22:37:16 -0700

Dear R Helpers,

I would like to pull the CIK number from the web page


http://www.sec.gov/cgi-bin/browse-edgar?CIK=MSFT&Find=Search&owner=exclude&action=getcompany

If you put this web page into your browser you will see the CIK number in
red on the left side of the page near the top.

When I try the basic
require(scrapeR)
require(XML)
require(RCurl)
doc
<-htmlTreeParse("http://www.sec.gov/cgi-bin/browse-edgar?CIK=MSFT&Find=Search&owner=exclude&action=getcompany";)
str(doc)

I get a large number of items in the data frame that I don't know how to
interpret.  Both
tables <- readHTMLTable(doc)

and

list<-xmlToList(doc)

result in errors.

Any (positive) guidance would be much appreciated.

--John J. Sparks, Ph.D.

______________________________________________
[email protected] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Grap Element from Web Page

Reply via email to