top | item 992754

XML-RPC client module for Node.js (uses libxml.js)

27 points| caludio | 16 years ago |github.com | reply

7 comments

order
[+] Maciek416|16 years ago|reply
I notice this client is based on libxml.js.

Does anyone know if anyone is working on a "flexible" HTML/XHTML parser a la BeautifulSoup / Nokogiri / TagSoup, etc? Node could become very useful as a base for building scrapers if this existed.

[+] sprsquish|16 years ago|reply
I've been trying to model libxml.js after Nokogiri. I wanted to get something built and working first. The next step is to expose libxml2's html parser.

Someone else has started working on find-by-CSS a la Nokogiri. I'll merge that into libxml.js when it's ready.

BTW, I'm looking for more help on this project. A new job has diminished the amount of time I can spend on OSS projects.

[+] olegp|16 years ago|reply
As an alternative to libxml2 one could use tagsoup as a separate process and a JS only parser like XML for script.
[+] simonw|16 years ago|reply
libxml2 has an excellent tag soup parser (the Python lxml module makes heavy use of it) - but I don't think libxml.js exposes it yet.
[+] Sephr|16 years ago|reply
Why use libxml when JavaScript already has a standard XML API, E4X (ECMAScript for XML), as specified by ECMA 357? At least libxml should use the faster native XML support behind the scenes if available.