es-extractor

Provides an REST API to the mercury webparser. Able to retrieve content from web pages and return in them in JSON.

Endpoints

Both endpoints return a JSON object with the following fiels.

{
  "title": "Thunder (mascot)",
  "raw_content": "... Thunder is the stage name for the...",
  "markdown_content": "![ThunderII.jpg](https://upload.wikimedia.org/wikipedia/commons/thumb/9/93/ThunderII.jpg/220px-ThunderII.jpg)\n\nThunder\n\n[](https://en.wikipedia.org/wiki/File:ThunderII.jpg)\n\nThunder II and Ann Judge\n\nBreed[Arabian horse](https://en.wikipedia.org/wiki/Arabian_horse)Discipline...",
  "author": "Wikipedia Contributors",
  "date_published": "2016-09-16T20:56:00.000Z",
  "lead_image_url": null,
  "dek": null,
  "next_page_url": null,
  "url": "https://en.wikipedia.org/wiki/Thunder_(mascot)",
  "domain": "en.wikipedia.org",
  "excerpt": "Thunder Thunder is the stage name for the horse who is the official live animal mascot for the Denver Broncos",
  "word_count": 4677,
  "direction": "ltr",
  "total_pages": 1,
  "rendered_pages": 1
}

`/mercury/url`

Takes a request JSON object with a single url field. Downloades the page, retrieves raw and markdown content and returns.

`/mercury/html`

Takes a request JSON object with an url and a html field. The html field must contain the properly escaped html content of the url web page. This is done to prevent loading a page twice. Retrieves raw and markdown content and returns.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
Readme.md		Readme.md
logger.js		logger.js
main.js		main.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

es-extractor

Endpoints

`/mercury/url`

`/mercury/html`

About

Uh oh!

Releases

Packages

Languages

MMMoA/es-extractor

Folders and files

Latest commit

History

Repository files navigation

es-extractor

Endpoints

/mercury/url

/mercury/html

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`/mercury/url`

`/mercury/html`

Packages