<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>XML on Saleem Ansari</title>
    <link>/tags/xml/</link>
    <description>Recent content in XML on Saleem Ansari</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>en</language>
    <copyright>(c) 2024 Saleem Ansari</copyright>
    <lastBuildDate>Mon, 03 Feb 2014 00:00:00 +0000</lastBuildDate>
    <atom:link href="/tags/xml/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>A simple Scala parser to parse 44GB Wikipedia XML Dump</title>
      <link>/2014/02/03/a-simple-scala-parser-to-parse-44gb-wikipedia-xml-dump/</link>
      <pubDate>Mon, 03 Feb 2014 00:00:00 +0000</pubDate>
      <guid>/2014/02/03/a-simple-scala-parser-to-parse-44gb-wikipedia-xml-dump/</guid>
      <description>I had to parse a Wikipedia XML Dump ( 44GB XML file uncompressed ). The XML dump is available here, and I have also created a smaller sample file to run this code: sample wiki.xml file.&#xA;Below is the XML event based parser using Scala&amp;rsquo;s XMLEventReader:&#xA;package xml import scala.io.Source import scala.xml.pull._ import scala.collection.mutable.ArrayBuffer import java.io.File import java.io.FileOutputStream import scala.xml.XML object wikipedia extends App { val xmlFile = args(0) val outputLocation = new File(args(1)) val xml = new XMLEventReader(Source.</description>
    </item>
    <item>
      <title>XML, RSS, ATOM and Java</title>
      <link>/2012/10/03/xml-rss-atom-and-java/</link>
      <pubDate>Wed, 03 Oct 2012 00:00:00 +0000</pubDate>
      <guid>/2012/10/03/xml-rss-atom-and-java/</guid>
      <description>I was searching for ways to generate xml feeds ( ATOM / RSS ) using Java. It appeared to be trivial task but its not. There are so many different libraries in Java which are capable of reading and writing XML in Java that it became a daunting task to evaulate them. After a bit of experimentation I have settled down with JDOM. JDOM is so simple to use.&#xA;On top of that, ROME tools make it even easier to read/write feeds using Java.</description>
    </item>
  </channel>
</rss>
