Hi All,
I am working on situation where i have to write code to parse XML files with following :
1) Size of file will be Huge in future (So the code should be smart enough to deal with it). It may maximum reach upto 8GB.
2) The XML is not proper as it has no fix root tag rather have multiple XML in a single XML. (So we may need to add root tag and delete repated <?xml version and <!DOCTYPE ? tags) or split into multiple xml before parsing.
3) There is no XSD provide by the client rather DTD is given.
Do anyone has anything to share for such problem statement. I am thinking of multiple approach to deal with this situation like using shell script, SAX or sTax etc.
Just came to this forum, to see if we have similar problem faced by someone on future and i can be lucky