Python - Flask Frontend to generate Elasticsearch Docs from Sitemaps
I wrote a small Python script that helps me to generate Elasticsearch docs from web urls. Here I am adding an web frontend to it using Flask.
To use this script you have to adjust it according to your own Elasticsearch mapping and page elements.
Generate a Single Entry
Paste in a URL and generate an JSON object from it that you can use with Kibana to add the page to Elasticsearch:
Process Sitemaps to retrieve all URLS
Process a pages XML Sitemap and extract all URLs:
Generate Entries for an List of URLs
Use the generated file from above and generated Elasticsearch entries for every page in your website: