Here's a small sample of some test code that simply goes to
See here[
^]
and pulls in the specified web page.
<<pre lang="php">?php
$url = "http://www.un.org/apps/news/story.asp?NewsID=37180&Cr=Haiti&Cr1=";
$curl = curl_init();
curl_setopt($curl, CURLOPT_URL, $url);
curl_setopt($curl, CURLOPT_FAILONERROR, 1);
curl_setopt($curl, CURLOPT_COOKIESESSION, TRUE);
curl_setopt($curl, CURLOPT_FOLLOWLOCATION, 1);
curl_setopt($curl, CURLOPT_RETURNTRANSFER, 1);
curl_setopt($curl, CURLOPT_TIMEOUT, 20);
curl_setopt($curl, CURLOPT_USERAGENT, "Mozilla/5.0 (Windows NT 5.1; U; en; rv:1.8.0) Gecko/20060728 Firefox/1.5.0" );
$result = curl_exec($curl);
print $result;
<
When I look at the result, however, it's not quite what I'm wanting. If you look in the results for the string
"<span class=fullstory>United Nations humanitarian officials are calling for ?massive mobilization activities? in Haiti"
you can see the two question marks surrounding the text "massive mobilization activities".
If you go to the actual website, the question marks are rendered as a pair of left- and right- quotation marks, and this is reflected when you view the source code from the site ...
"<span class=fullstory>United Nations humanitarian officials are calling for “massive mobilization activities” in Haiti"
I'd like to know how I can grab the double quotes rather than the question marks that I'm seeing.
All suggestions gratefully accepted.
And happy new year to y'all