The docx format is actually pretty simple: it is a zipped set of files. If you upload it to the server and unzip it, you'll see a set of xml files. You can poke around and figure out the format, or you can find a spec on line.
On Thu, Sep 13, 2012 at 9:59 AM, Walter Lee Davis <wa...@wdstudio.com> wrote: > > On Sep 13, 2012, at 7:35 AM, rovin varshney wrote: > >> Hello Everyone, >> I m looking for parsing doc/docx file in ruby on rails. >> I have use File.open('filename','r'), but it shows special character >> instead of the content of file . > > If all you want is the text content of the files, you can try the ancient > Unix utility catdoc to do that. Just back-tick to that command (and make sure > it's installed in your Web server's path). The result will not be pretty, but > it will have all of the words in it. > > Walter > >> >> Thanks. >> >> -- >> You received this message because you are subscribed to the Google Groups >> "Ruby on Rails: Talk" group. >> To post to this group, send email to rubyonrails-talk@googlegroups.com. >> To unsubscribe from this group, send email to >> rubyonrails-talk+unsubscr...@googlegroups.com. >> To view this discussion on the web visit >> https://groups.google.com/d/msg/rubyonrails-talk/-/O5fkWF3a1ecJ. >> For more options, visit https://groups.google.com/groups/opt_out. >> >> > > -- > You received this message because you are subscribed to the Google Groups > "Ruby on Rails: Talk" group. > To post to this group, send email to rubyonrails-talk@googlegroups.com. > To unsubscribe from this group, send email to > rubyonrails-talk+unsubscr...@googlegroups.com. > For more options, visit https://groups.google.com/groups/opt_out. > > -- You received this message because you are subscribed to the Google Groups "Ruby on Rails: Talk" group. To post to this group, send email to rubyonrails-talk@googlegroups.com. To unsubscribe from this group, send email to rubyonrails-talk+unsubscr...@googlegroups.com. For more options, visit https://groups.google.com/groups/opt_out.