Re: [R] what is the faster way to search for a pattern in a few million entries data frame ?

Fabien Tarrade Sun, 10 Apr 2016 15:29:10 -0700

Hi Duncan,

Didn't you post the same question yesterday? Perhaps nobody answeredbecause your question is unanswerable.

sorry, I got a email that my message was waiting for approval and when Ilook at the forum I didn't see my message and this is why I sent itagain and this time I did check that the format of my message was textonly. Sorry for the noise.

You need to describe what the strings are like and what the patternsare like if you want advice on speeding things up.

my strings are 1-gram up to 5-grams (sequence of 1 work up to 5 words)and I am searching for the frequency in my DF of the strings startingwith a sequence of few words.

I guess these days it is standard to use DF with millions of entries soI was wondering how people are doing that in the faster way.


Thanks
Cheers
Fabien

--
Dr Fabien Tarrade

Quantitative Analyst/Developer - Data Scientist

Senior data analyst specialised in the modelling, processing andstatistical treatment of data.PhD in Physics, 10 years of experience as researcher at the forefront ofinternational scientific research.

Fascinated by finance and data modelling.

Geneva, Switzerland

Email : <mailto:cont...@fabien-tarrade.eu>cont...@fabien-tarrade.eu
Phone : <http://www.fabien-tarrade.eu>www.fabien-tarrade.eu
Phone : +33 (0)6 14 78 70 90

LinkedIn <http://ch.linkedin.com/in/fabientarrade/> Twitter<https://twitter.com/fabtar> Google<https://plus.google.com/+FabienTarradeProfile/posts> Facebook<https://www.facebook.com/fabien.tarrade.eu> Google<skype:fabtarhiggs?call> Xing <https://www.xing.com/profile/Fabien_Tarrade>

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] what is the faster way to search for a pattern in a few million entries data frame ?

Reply via email to