ASPN ActiveState Programmer Network
ActiveState
/ Home / Perl / PHP / Python / Tcl / XSLT /
/ Safari / My ASPN /
Cookbooks | Documentation | Mailing Lists | Modules | News Feeds | Products | User Groups


Recent Messages
List Archives
About the List
List Leaders
Subscription Options

View Subscriptions
Help

View by Topic
ActiveState
.NET Framework
Open Source
Perl
PHP
Python
Tcl
Web Services
XML & XSLT

View by Category
Database
General
SOAP
System Administration
Tools
User Interfaces
Web Programming
XML Programming


MyASPN >> Mail Archive >> ruby-talk
ruby-talk
HTML table to matrix with WWW::Mechanize
by Adam Hinchliffe other posts by this author
Nov 12 2006 11:15AM messages near this date
Re: need a new Ruby book? | Re: HTML table to matrix with WWW::Mechanize
Hi,

I am new to ruby and am trying to scrape a website table into a matrix, 
I have been playing around with WWW::Mechanize and have had some success 
getting the page, extracting the table I want and then separating the 
result by table rows.  The problem comes with then splitting it down by 
table data, I am going through each result in the array to break it by 
the <td>  tag, but my code appears to have zero effect!

The code I am using is below; any help in getting the table into a 
matrix would be really appreciated.

Thanks

Adam


 require 'rubygems'
 require 'mechanize'


  agent = WWW::Mechanize.new
  agent.user_agent_alias = 'Mac Safari'
  page = 
agent.get("http://horses.sportinglife.com/Racecards/0,12495,215137,00.html").search("//table
[@class='racecard_table']")
  tablerows = page.search("//tr")
  puts tablerows.length
  finalresult = Array.new
  tablerows.each do |tablerows|
     finalresult << tablerows.search("//td")
  end
  puts finalresult.length

-- 
Posted via http://www.ruby-forum.com/.
Thread:
Adam Hinchliffe
Paul Lutus

Privacy Policy | Email Opt-out | Feedback | Syndication
© ActiveState Software Inc. All rights reserved