API call to download digital files to run corpus analysis or crawl

We would like to download the Alma digital files using an API to conduct corpus analysis of our content. At present the available URLs pointing to the PDFs are pointing to the landing pages of the PDFs. It is not possible to extract the PDF from these pages as they load everything with Javascript in a browser.

Furthermore, it is not possible to automate the use of the job available to download digital files to meet our requirements.

There is no API available to construct URLs to get the underlying PDF for digital versions based on the records' metadata.

11 votes

Anonymous shared this idea · Jan 15, 2024 · Admin →

An error occurred while saving the comment

Anonymous commented · January 24, 2024 3:32 PM

Or a way to convert the PDFs files to HTML so they can be crawled in an automated way-

Submitting...

Alma: Resource Management - Digital

Give feedback

Alma 1,996 ideas

campusM 153 ideas

Content 381 ideas

Esploro 163 ideas

Leganto 229 ideas

Pivot-RP 82 ideas

Primo 550 ideas

RapidILL 43 ideas

Rapido 60 ideas

RefWorks 69 ideas

Rialto 109 ideas

Rosetta 184 ideas

Summon 284 ideas

How can we improve Alma?

API call to download digital files to run corpus analysis or crawl

Feedback

Alma: Resource Management - Digital

Feedback and Knowledge Base

Searching…

Contact support

Give feedback

Knowledge Base

Ex Libris

API call to download digital files to run corpus analysis or crawl

We're glad you're here

We're glad you're here

We're glad you're here

We're glad you're here

Alma: Resource Management - Digital

Categories

Searching…

Contact support

Give feedback

Knowledge Base

Ex Libris