Hi,
Try this: one = data.frame(keyword=c("|auto", "NA|auto|insurance|quote", "NA|auto|insurance", "NA|insurance", "NA|auto|insurance", "<NA>")) onenew<-data.frame(keyword=gsub("(NA){0,1}\\|","",one$keyword)) onenew1<-data.frame(keyword=gsub("(<NA>){0,1}","",onenew$keyword)) onenew1 keyword 1 auto 2 autoinsurancequote 3 autoinsurance 4 insurance 5 autoinsurance 6 A.K. ----- Original Message ----- From: Abraham Mathew <abmathe...@gmail.com> To: r-help@r-project.org Cc: Sent: Thursday, July 19, 2012 3:21 PM Subject: [R] Removing values from a string So I have the following data frame and I want to know how I can remove all "NA" values from each string, and also remove all "|" values from the START of the string. So they should something like "auto|insurance" or "auto|insurance|quote" one = data.frame(keyword=c("|auto", "NA|auto|insurance|quote", "NA|auto|insurance", "NA|insurance", "NA|auto|insurance", "<NA>")) one Can anyone point me in the right direction? I'm still not too familiar with regex or gsub to find a solution, and there doesn't seem to be anything helpful in the stringr package for this task. Thanks -- *Abraham Mathew Statistical Analyst www.amathew.com 720-648-0108 @abmathewks* [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.