What's the best XML parser for Perl?

22,250

Solution 1

I think you are using a pretty good one. XML::LibXML, Matt Sergeant and Christian Glahn's Perl interface to Daniel Velliard's libxml2 is one of the faster XML Parsers that I know of.

Solution 2

It really depends on your needs, as people have said. To parse XML files that were ~100Mb in size (gene annotations from TAIR, 1 file per chromosome), I used mirod's XML::Twig module, which lets you set callbacks to parse the elements that interest you, presenting each sub-document as an XML::Simple tree. It combines the benefits of a SAX parser (scanning the file as a stream) with a DOM parser (working more easily with the interesting pieces).

Solution 3

If you need speed, power or features, XML::LibXML is the way to go. If you're after ease of use, though, XML::Simple is a viable alternative.

Solution 4

In my experience XML::Simple is best for quick and dirty parsing of XML. We use it for parsing data from third parties that do not always conform to the XML standard. XML::Simple throws informative errors and gets you up an running extremely quickly.

Solution 5

(Actually it's not an answer, but a comment - however, I cannot comment...)

XML::Simple has been mentioned here.
(I know it's few from few years ago, but this appeared up in Google today...)

However, it's site (http://metacpan.org/pod/XML::Simple) now says:

STATUS OF THIS MODULE

The use of this module in new code is discouraged. Other modules are available which provide more straightforward and consistent interfaces. In particular, XML::LibXML is highly recommended.

The major problems with this module are the large number of options and the arbitrary ways in which these options interact - often with unexpected results.

Patches with bug fixes and documentation fixes are welcome, but new features are unlikely to be added.

Share:
22,250
Reinout van Rees
Author by

Reinout van Rees

Geek, pure and simple.

Updated on July 09, 2022

Comments

  • Reinout van Rees
    Reinout van Rees almost 2 years

    I have tried many of the Perl XML Parsers. I was quite interested in the Sablotron Parser, but it is such a pain to install on a Windows box. Currently I have started using XML::LibXML and XML::LibXSLT both of which seem to do everything I need.

    They seem to be quite standard as well. Are there any better XML Parsers to use than this?