Skip to content

SPWareing/AzureTimerTrigger

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Azure Timer Function to check for dataset update

This azure function uses the Pandas library with the Beautiful Soup extension in order to check if any dataset has been updated on the Historic England listing website and any heritage dataset from Historic Scotland.


How it Works

The NCRON expression for this repository is set to run daily at 7am.


"schedule": "0 0 7 * * *"

This is purely for testing, and it is recommended that a longer expression be used when uploaded to Azure, for example monthly:

"schedule": 0 0 7 1 1-12 *
  • When the timer is triggered it will send a request.get() call to the website. Upon a successful request code the request.text will be passed to a Pandas dataframe and processed.
  • A dataframe containing a reference to only the updated datasets will be output to Blob storage as a CSV.
  • The output CSV is uniquely identified by appending a DateTime to the filename: HE_event_{Datetime}.csv
  • For simplicity an Azure Logic App then checks the Blob storage for updates and then emails the results.

To do:

  • Update NCRON
  • Add Historic Scotland code
  • Add CADW code

About

Azure Timer Trigger Function to check website and output results to Blob Storage.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages