File with multiple Doctype declaration using CSPLIT

55 views Asked by At

I have a file with multiple Doctype declarations. I am trying to use CSPLIT to break the file up into smaller chunks but running into some issues. Here is a sample of the file I am working with:

<?xml version="1.0" ?>
<!DOCTYPE pmc-articleset PUBLIC "-//NLM//DTD ARTICLE SET 2.0//EN" "https://dtd.nlm.nih.gov/ncbi/pmc/articleset/nlm-articleset-2.0.dtd">

<pmc-articleset><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
  <?properties open_access?>
  <front>
    <p>
    Apple
    </p>
  </front>
</article>
</pmc-articleset>
<?xml version="1.0" ?>
<!DOCTYPE pmc-articleset PUBLIC "-//NLM//DTD ARTICLE SET 2.0//EN" "https://dtd.nlm.nih.gov/ncbi/pmc/articleset/nlm-articleset-2.0.dtd">
<pmc-articleset><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
  <?properties open_access?>
  <front>
    <p>
    Banana
    </p>
  </front>
</article>
</pmc-articleset>

Here is my command:

csplit -z --prefix output_file --suffix-format '%02d.xml' handSurgery.xml '/^<[?]xml[ ]/' '{*}'

Here are the errors:

csplit: illegal option -- z

Any solution would be appreciated. Thank you!

0

There are 0 answers