# XML Exporter (for PubMed and ISI)

DOS Batch program to export XML and SGML files to PubMed and ISI. Located in c:\\scielo\\xml\_scielo.

For PubMed, there are two types of files:

<div id="bkmrk-journal-data%3A-http%3A%2F"><div>- <dl class="first docutils" style="padding: 0px; margin: 0px !important 0px 0px 0px;"><dt>Journal data:</dt><dd style="margin-top: 3px; margin-bottom: 10px; margin-left: 30px; hyphens: auto; line-height: 1.4em;">[http://www.ncbi.nlm.nih.gov/books/bv.fcgi?rid=helplinkout.section.files.Resource\_File#files.Resource\_File\_Format](http://www.ncbi.nlm.nih.gov/books/bv.fcgi?rid=helplinkout.section.files.Resource_File#files.Resource_File_Format). Sent once or every time the journal data was changed.</dd></dl>
- <dl class="first docutils" style="padding: 0px; margin: 0px !important 0px 0px 0px;"><dt>Articles data:</dt><dd style="margin-top: 3px; margin-bottom: 10px; margin-left: 30px; hyphens: auto; line-height: 1.4em;">[http://www.ncbi.nlm.nih.gov/entrez/query/static/spec.html](http://www.ncbi.nlm.nih.gov/entrez/query/static/spec.html) Sent one XML file for each issue.</dd></dl>

<div></div></div></div>## Configuration

If it is the first installation, you have some procedures to execute.

There is a file in c:\\scielo\\xml\_scielo\\config.example. You have to copy and rename it to **config**.

Configure the files:

<div id="bkmrk-pubmed%5Cdoi_conf.txt-"><div><div>- PubMed\\doi\_conf.txt
- PubMed\\config\\config.seq
- PubMed\\journals\\journals.seq

<div></div></div></div></div>### File doi\_conf.txt

It contains the data of the Publisher and the prefix given by CrossRef, according to the agreement signed by CrossRef and the SciELO of each country. IF YOUR SCIELO DOES NOT HAVE IT. SO THIS FILE MUST BE EMPTY.

> INSTITUTION SPACE E-MAIL SPACE PREFIX
> 
> <div>![_images/xml_scielo_doiconf.jpg](https://scielo.readthedocs.io/projects/scielo-pc-programs/en/latest/_images/xml_scielo_doiconf.jpg)</div>

### File config\\config.seq

The file configconfig.seq is to inform to the program which articles or text must not be sent to PubMed, because some kind of documents are not accepted, and it is know by the section in the table of contents.

> Acronym space sectionId
> 
> <div>![_images/xml_scielo_scilista.jpg](https://scielo.readthedocs.io/projects/scielo-pc-programs/en/latest/_images/xml_scielo_scilista.jpg)</div>

### File journals\\journals.seq

This file journals\\journals.seq contains data used to generate XML file of the journal: journals\_acronimo.xml.

This is the first XML file which must be sent to PubMed in order to register the journal. Read more: [http://www.ncbi.nlm.nih.gov/books/bv.fcgi?rid=helplinkout.section.files.Resource\_File#files.Resource\_File\_Format](http://www.ncbi.nlm.nih.gov/books/bv.fcgi?rid=helplinkout.section.files.Resource_File#files.Resource_File_Format).

Its format is:

> <div>ACRONYM SPACE FIRST_YEAR_IN_PubMed SPACE SCIELO_URL SPACE ISSN</div>

One line for each journal.

> <div>![_images/xml_scielo_journal.jpg](https://scielo.readthedocs.io/projects/scielo-pc-programs/en/latest/_images/xml_scielo_journal.jpg)</div>

## Executing

It has to be executed using the command line in DOS.

> <div>![_images/xml_scielo_doscommand.jpg](https://scielo.readthedocs.io/projects/scielo-pc-programs/en/latest/_images/xml_scielo_doscommand.jpg)</div>

Go to the folder where this program is installed. E.g.: c:\\scielo\\xml\_scielo\\proc.

> <div>![_images/xml_scielo_doscommand2.jpg](https://scielo.readthedocs.io/projects/scielo-pc-programs/en/latest/_images/xml_scielo_doscommand2.jpg)</div>

In proc you will find three scripts:

<div id="bkmrk-generatexml_all.bat%3A"><div><div>- GenerateXML\_all.bat: generates at the same time ISI and PubMed
- GenerateXML\_ISI.bat: generates SGML to ISI
- GenerateXML\_PubMed.bat: generates XML to PubMed
    
    > <div>![_images/xml_scielo_doscommand3.jpg](https://scielo.readthedocs.io/projects/scielo-pc-programs/en/latest/_images/xml_scielo_doscommand3.jpg)</div>

</div></div></div>Provide a list similar to scilist, to execute any of them.

The program will open the scilist file and you have to check it, and include or remove lines, according to what you want to generate.

> <div>![_images/xml_scielo_doscommand4.jpg](https://scielo.readthedocs.io/projects/scielo-pc-programs/en/latest/_images/xml_scielo_doscommand4.jpg)</div>

To generate also the XML file which contains journal data, journals\_&lt;acronimo&gt;.xml, add one more parameter “YES”:

<div id="bkmrk-hcsm-v13n2-yes"><div><div><div><div>```
hcsm v13n2 YES
```

</div></div></div></div></div>To generate ONLY the XML file which contains journal data, journals\_&lt;acronimo&gt;.xml, the second parameter must be “NONE” and third one must be YES.

<div id="bkmrk-hcsm-none-yes"><div><div><div><div>```
hcsm NONE YES
```

</div></div></div></div></div>To generate XML file of ahead articles, use as:

<div id="bkmrk-fourth-parameter%3A-th"><div><div>- fourth parameter: the start date
- fifth parameter: the end date

<div><div>```
hcsm 2014nahead YES 20140100 20140228
```

</div></div></div></div></div>The program will generate the XML file for articles which has ahpdate (publication date of ahead) between 20140100 and 20140228.

The name of the XML file will be hcsm2014nahead20140100-20140228.xml.

# SciELO XML to PubMed XML

Program to export XML to PubMed, according to [http://www.ncbi.nlm.nih.gov/books/NBK3828/](http://www.ncbi.nlm.nih.gov/books/NBK3828/), using SciELO XML (SciELO Publishing Schema).

## How to execute

Double clicking on c:\\scielo\\bin\\xml\\xml\_pubmed.py

> <div>![_images/xml2pubmed_window.png](https://scielo.readthedocs.io/projects/scielo-pc-programs/en/latest/_images/xml2pubmed_window.png)</div>

Select the issue folder

> <div>![_images/xml2pubmed_chose_folder.png](https://scielo.readthedocs.io/projects/scielo-pc-programs/en/latest/_images/xml2pubmed_chose_folder.png)</div>

Only if issue is published on batches, such aop or rolling pass, you should inform **from date** to generate XML for the article published from this date to the current date.

Then click on OK button.

According to the example, the program will create the file: v:\\scielo\\serial\\rsp\\v48n5\\PubMed\\rsp-v48n5-20160510-20160523.xml, containing articles which have epub date between 20160510 and the current date.

> <div>![_images/xml2pubmed_from_date.png](https://scielo.readthedocs.io/projects/scielo-pc-programs/en/latest/_images/xml2pubmed_from_date.png)</div>

If it is not an issue published on batches, click on OK button. According to the example, the program will create the file: v:\\scielo\\serial\\rsp\\v48n5\\PubMed\\rsp-v48n5.xml.

> <div>![_images/xml2pubmed_chosen_folder.png](https://scielo.readthedocs.io/projects/scielo-pc-programs/en/latest/_images/xml2pubmed_chosen_folder.png)</div>

Or execute it on a terminal:

> <div>![_images/xml2pubmed_terminal.png](https://scielo.readthedocs.io/projects/scielo-pc-programs/en/latest/_images/xml2pubmed_terminal.png)</div>

Optionally informing the **from date**

> <div>![_images/xml2pubmed_terminal_from_date.png](https://scielo.readthedocs.io/projects/scielo-pc-programs/en/latest/_images/xml2pubmed_terminal_from_date.png)</div>