Hi, everyone! I'm a novice at R with tm package. So I need your help!!
I'd like to analyze some German texts using tm package for my papers. But somehow, I could not use it well. Now I'm using R ver.2.10.1 and tm package ver.0.5-2 on WindowsXP and also MacOS X 10.6 (snow leopard). Input commands that I tried are like these; > txt <- system.file("c:\\text\\", "ABHANDLUNGXII.txt", package = "tm") > txt [1] "" > It seems to not be read "ABHANDLUNGXII.txt" into "txt". I've also tried some other commands; > html <- system.file("c:\\text\\", "ABHANDLUNGXII.html", package = "tm") > html [1] "" It also seems nothing to happened. Then I've checked the document reader by function "getReaders()" in tm package; > getReaders() [1] "readDOC" "readGmane" "readPDF" [4] "readReut21578XML" "readReut21578XMLasPlain" "readPlain" [7] "readRCV1" "readRCV1asPlain" "readTabular" > There seems to be no "readHTML". But there is "readPDF" and "readPlain". I haven't touch anything around the package (I mean, "sytem level.") I only installed some other standard package using the package manager in R. Is any wrong syntax or operation? Could anyone give me suggestions and advice about this? Are there anything which I have to prepare around my documents? Pointer to useful documents and sites are also welcome. Could you please help me? Thanks in advance! Reagards, Lica Oka from Japan. (sorry if there is anything wrong expression in English.) [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.