Richard Liu wrote:
I'm new to R.  I'm working with the text mining package tm.  I have several
plain text documents in a directory, and I would like to read all the files
with extension .txt in that directory into a vector, one text document per
vector element.  That is, v[1] would be the first document, v[2] the second,
etc.

I know how to read the documents into a tm Corpus, but that's not what I
want to do.  I would think that this kind of operation should be elementary
and the first step in any text mining.

Thanks,
Richard
Hi Richard,

Try somthing along these lines:

file_list = list.files("/where/are/the/files")
obj_list = lapply(file_list, FUN = yourfunction)

yourfunction is probably either read.table or some read function from the tm package. So obj_list will become a list of either data.frame's or tm objects.

cheers,
Paul

--
Drs. Paul Hiemstra
Department of Physical Geography
Faculty of Geosciences
University of Utrecht
Heidelberglaan 2
P.O. Box 80.115
3508 TC Utrecht
Phone:  +3130 274 3113 Mon-Tue
Phone:  +3130 253 5773 Wed-Fri
http://intamap.geo.uu.nl/~paul

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to