Welcome to ScienceScraper!#

_images/pd-logo.png _images/pd-logo.png

ScienceScraper is an Python package that allows you to scrape scientific articles from various publishers through ScienceDirect and PubMedCentral. Designed to be used for training large machine learning models, ScienceScraper provides a simple API to retrieve clean and structured data from scientific articles directly from their DOIs, PIIs, PMIDs, or URLs.

Getting Started
_images/getting_started.svg

Learn how to install and use ScienceScraper to search and retrieve scientific publications.

Let’s get started!

User Guide
_images/user_guide.svg

Dive into the details of the ScienceScraper with a comprehensive tutorial.

Learn more!

API Reference
_images/api_reference.svg

Explore the ScienceScraper API reference to find out more about the available modules and functions.

Explore the API!

Contributing
_images/contributing.svg

Want to add something new? Contribute to the ScienceScraper project through GitHub.

Learn how to contribute!

Indices and tables#