Hi all, 

I have the following data for which I create a document term matrix first and 
then I add the time available to the dtm. In order to see the correlations to 
the term "updat" in the different years, I would like to have a heat-map for 
findassoc in a way that x-axis shows the time. 


  
> library(tm)  
library(ggplot2)
  > dput(df) 
structure(list(Description = structure(c(5L, 8L, 6L, 4L, 1L, 
2L, 7L, 9L, 10L, 3L), .Label = c("general topics done", "keep the general 
topics updated", 
"rejected topic ", "several topics in hand", "this is a genetal topic", 
"topic 333555 needs to be updated", "topic 5647 is handed over", 
"topic is updated", "update the topic ", "updating the topic is done " 
), class = "factor")), class = "data.frame", row.names = c(NA, 
-10L))
> corpus=Corpus(VectorSource(df$Description)) 
> corpus=tm_map(corpus,tolower)
> corpus=tm_map(corpus,removePunctuation)
corpus=tm_map(corpus,removeWords,c(stopwords("english")))
> corpus=tm_map(corpus,stemDocument,"english")
> frequenciescontrol=DocumentTermMatrix(corpus)
frequenciescontrol$time=c("2015","2015","2015","2015","2015","2016","2016","2016","2016","2016")
findAssocs(frequenciescontrol, "updat", 0.01)


Heatmap looking: y axis-> all the words correlated to "updat"      x axis: 
years               legend:correlation  

Thanks for any help.
Elahe!

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to