Does anyone know of a good parser for wikipedia content? I don't want to write a full parser if I don't have to.
The parsing shouldn't be too bad in perl ( or any language with regex ) but I feel there should be a program since this sounds like something a lot of people would do.
export a file from wikipedia: (I would use a random animator name as an example "Craig_Clark")
wikipedia exportthen from the xml page I would only use the text part
<text xml:space="preserve" bytes="3618"> text text 3618 bytes of text </text>
but what I need to do is convert the wiki text (ex: [[animator]]) to regular text (ex: animator).
anyway thanks for your comments.