Send a url that points to the page you want to strip the info from.  Or look into 
another service or site that may give you the quotes in a raw text format.  Also look 
into the http://www.php.net/manual/en/function.strip-tags.php function in PHP.

Michael

On Saturday, January 13, 2001, at 03:11 PM, James Duncan wrote:

> Hi again, 
>  
> Thanks for your help so far. I will explain what I'm trying to achieve. I 
> want to pull down a web page that contains share prices, extract those share 
> prices, and update a database with the new prices. What I want is a nice and 
> neat solution that runs like a service (i.e. I can stop and start it from a 
> web browser, change the update interval (how long it waits before it repeats 
> the process)), etc. 
>  
> I know a little PHP and Javascript (what I've taught/learnt over the last 
> few weeks). The process I have so far (not implemented at all yet): 
>  
> 1) PHP script that pulls down the relevant web page to my server 
> 2) Data extraction from HTML web page 
> 3) Updating of database with data from step 2 
> 4) Running step 1 again after a certain period of time 
>  
> Step 2 is the most complex by far. I was hoping to use PHP to access the 
> #text value via DOM but obviously this isn't possible because the DOM 
> doesn't exist server-side but only after the HTML has been rendered 
> client-side. Like you say, I could create a form and hidden fields in the 
> HTML file and use Javascript to read the #text node values into the hidden 
> fields. Then trigger a POST operation, where I can read in all the values 
> but I don't like the sound of this because then I would have to have a 
> browser interacting with my PHP scripts!?! I'm trying to create a 
> self-contained "service" that doesn't have any external dependants. 
>  
> Is there any other way of accomplishing this without involving a browser? 
> The only other way I can see is to use PHP to strip all HTML tags, leaving 
> just the text? I could then write PHP code to read in the remaining text, 
> etc. 
>  
> What is the best way to accomplish this? Is there a PHP command that strips 
> all HTML tags (and Javascript, etc) from an HTML file? Example code would be 
> great ;) 
>  
> Thanks 
>  
> James 
>  
>  
> -----Original Message----- 
> From: Michael Stearne [mailto:[EMAIL PROTECTED]] 
> Sent: 13 January 2001 18:40 
> To: James Duncan 
> Cc: [EMAIL PROTECTED]; [EMAIL PROTECTED] 
> Subject: RE: [PHP-WIN] DOM 
>  
> No, actually I think that using PHP to insert the FORM and then use 
> Javascript to get the #text node values would be easier.  Parsing Table 
> cells using RegExs is no fun. 
>  
> Michael 
>  
> On Saturday, January 13, 2001, at 01:06 PM, James Duncan wrote: 
>  
> > But surely if I'm using fopen to insert a hidden form and fields I might 
> as 
> > well use fopen to extract the data from the HTML page in the first place? 
> It 
> > just seems so much easier to capture the #text node values from the DOM, 
> > rather than using fopen to locate the same information!?! 
> > 
> > Another idea would be to use my Javascript to capture the text node values 
> > from the DOM and write it to a cookie file. The contents of the cookie 
> file 
> > could then by read by PHP to populate the database? 
> > 
> > Thanks 
> > 
> > James 
> > 
> > 
> > -----Original Message----- 
> > From: Michael Stearne [mailto:[EMAIL PROTECTED]] 
> > Sent: 13 January 2001 17:32 
> > To: James Duncan 
> > Cc: [EMAIL PROTECTED]; [EMAIL PROTECTED] 
> > Subject: RE: [PHP-WIN] DOM 
> > 
> > 
> > On Saturday, January 13, 2001, at 12:20 PM, James Duncan wrote: 
> > 
> > > I don't think this will work in my case because I don't control the 
> layout 
> > > of the HTML page and hence can't add the hidden fields. I'm downloading 
> > the 
> > > HTML pages from a website. It would require as much work to insert the 
> > > hidden fields as trying to strip the HTML tags in an attempt to read the 
> > > data directly from the HTML page itself. There must be a way to access 
> the 
> > > DOM directly from PHP? I notice in the manual there is a section 
> regarding 
> > > XML DOM but not the DOM itself. 
> > > 
> > > Are the DOM values only available on the client? If that's the case then 
> > PHP 
> > > can't be used to read them because it's limited to the server side? 
> > 
> > Well by the time you are talking about PHP is out of the picture.  PHP can 
> > be used to generate a DOM but once its generated PHP (or any server side 
> > language) is out of the picture, it then goes to the client-side stuff 
> like 
> > you said. You can use PHP's fopen() to grab the page and then add the form 
> > and hidden fields I was talking about.  By doing this, you are setting up 
> > the page to be handled correctly by the Javascript code you inserted 
> through 
> > PHP. 
> > 
> > Michael 
> > 
> > > 
> > > Thanks 
> > > 
> > > James 
> > > 
> > > -----Original Message----- 
> > > From: Michael Stearne [mailto:[EMAIL PROTECTED]] 
> > > Sent: 13 January 2001 17:06 
> > > To: James Duncan 
> > > Cc: [EMAIL PROTECTED] 
> > > Subject: Re: [PHP-WIN] DOM 
> > > 
> > > Could you do something like: 
> > > 
> > > 
> > 
> myForm.myField.value=tablejames.firstChild.childNodes[1].childNodes[4].first 
> > > Child.firstChild.node Value; 
> > > 
> > > Set up a form of hidden fields.  Extract the values from the DOM and 
> then 
> > > have the user hit a Submit button to get to the next page.  At that 
> point 
> > > the values that were collected and put into the hidden form fields will 
> be 
> > > submitted and you next page (the PHP page) could INSERT the values into 
> > the 
> > > database, 
> > > 
> > > Michael 
> > > 
> > > 
> > > On Friday, January 12, 2001, at 07:30 PM, James Duncan wrote: 
> > > 
> > > > Hi folks, 
> > > > 
> > > > I'm still new to HTML, Javascript and PHP but learning (fast 
> hopefully). 
> > > > I've just started accessing DOM elements. I have worked out how to 
> > update 
> > > > the contents of table cells directly using this method, etc. In 
> > Javascript 
> > > I 
> > > > would use code like: 
> > > > 
> > > >   alert("Value is: " + 
> > > > 
> > > 
> > 
> tablejames.firstChild.childNodes[1].childNodes[4].firstChild.firstChild.node 
> > > > Name); 
> > > >   alert("Value is: " + 
> > > > 
> > > 
> > 
> tablejames.firstChild.childNodes[1].childNodes[5].firstChild.firstChild.node 
> > > > Value); 
> > > > 
> > > > This Javascript shows the name and value of the child element. 
> > > > 
> > > > Now I want to use PHP to extract data (values) from HTML pages like I 
> do 
> > > > with the above Javascript. Is this possible? Obviously with the 
> > Javascript 
> > > > the HTML page has already been rendered in the browser (i.e. all tree 
> > > > elements have been created). This makes extracting data a simple case 
> of 
> > > > finding the "#text" elements and reading in the values. Can I do the 
> > same 
> > > > thing with PHP and an HTML file I've downloaded from the Internet? 
> > > Obviously 
> > > > this file is sitting on my server and hasn't been rendered in a 
> > browser... 
> > > > 
> > > > The whole point of this exercise is so that I can extract values from 
> an 
> > > > HTML table and populate them into a database. Maybe it's easier to 
> > process 
> > > > the HTML file line by line and strip the unwanted HTML tags? However, 
> > with 
> > > > this approach I've got to hardcode each webpage... 
> > > > 
> > > > If this is a silly question then sorry but you only learn if you ask 
> ;) 
> > > > 
> > > > Thanks 
> > > > 
> > > > James 
> > > > 
> > > > 
> > > > 
> > > > -- 
> > > > PHP Windows Mailing List (http://www.php.net/) 
> > > > To unsubscribe, e-mail: [EMAIL PROTECTED] 
> > > > For additional commands, e-mail: [EMAIL PROTECTED] 
> > > > To contact the list administrators, e-mail: 
> [EMAIL PROTECTED] 
> > > > 
> > > > 
> > > > 
> > > 
> > > 
> > > 
> > 
> > 
> > -- 
> > PHP Windows Mailing List (http://www.php.net/) 
> > To unsubscribe, e-mail: [EMAIL PROTECTED] 
> > For additional commands, e-mail: [EMAIL PROTECTED] 
> > To contact the list administrators, e-mail: [EMAIL PROTECTED] 
> > 
> > 
> > 
>  
>  
>  

--
PHP Windows Mailing List (http://www.php.net/)
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
To contact the list administrators, e-mail: [EMAIL PROTECTED]

Reply via email to