ASPN ActiveState Programmer Network
ActiveState
/ Home / Perl / PHP / Python / Tcl / XSLT /
/ Safari / My ASPN /
Cookbooks | Documentation | Mailing Lists | Modules | News Feeds | Products | User Groups


Recent Messages
List Archives
About the List
List Leaders
Subscription Options

View Subscriptions
Help

View by Topic
ActiveState
.NET Framework
Open Source
Perl
PHP
Python
Tcl
Web Services
XML & XSLT

View by Category
Database
General
SOAP
System Administration
Tools
User Interfaces
Web Programming
XML Programming


MyASPN >> Mail Archive >> pyxpcom
pyxpcom
Re: [pyxpcom] Python web crawler using Gecko
by Todd Whiteman other posts by this author
Mar 31 2008 8:23AM messages near this date
view in the new Beta List Site
Re: [pyxpcom] Python web crawler using Gecko | Re: [pyxpcom] Python web crawler using Gecko
Ivan VrtariÄ? wrote:
>  Now, I'm almost completely sure I'm doing a lot of things wrong, but I 
>  don't know where, how, or why. What I'd like to ask is, is this the 
>  right way of using the PyXPCOM at all, and is it even possible to use 
>  Gecko from Python in this manner (for example, to instance it, pass it 
>  some URL and retrieve the DOM tree from it)?
> 
>  Some code snippets would be appreciated, too :)
>    

Hi Ivan,

I'd suggest taking a look at the "hulahop" code that the OLPC (One 
Laptop Per Child) are using for running and controlling the Gecko engine 
with Python (it's a git repository). The "tree/webview.py" looks similar 
to what your trying to do here.

http://dev.laptop.org/git?p=projects/hulahop;a=tree

Note: there are also ready made Ubuntu packages for the hulahop module 
already, so you could use synaptic/apt-get to install it.

Cheers,
Todd

_______________________________________________
pyxpcom mailing list
pyxpcom@[...].com
To unsubscribe: http://listserv.ActiveState.com/mailman/mysubs
Thread:
ivrtaric
Mark Hammond
ivrtaric
Todd Whiteman
ivrtaric

Privacy Policy | Email Opt-out | Feedback | Syndication
© 2004 ActiveState, a division of Sophos All rights reserved