Most of my work is published on GitHub https://github.com/j535d165.
The Python Record Linkage Toolkit is a Python module to perform data linkage and deduplication using quasi-identifying variables. The toolkit provides an intuitive API with extensive documentation. The toolkit has more than 150k installations a month (Januari 2022).
CBSOData is a tool to retrieve data from the open data interface of Statistics Netherlands (Centraal Bureau voor de Statistiek) with Python. The data is identical in content to the tables which can be retrieved and downloaded from StatLine. CBS datasets are accessed via the CBS open data portal. More info can be found at the website of CBS: https://www.cbs.nl/en-gb/corporate/2019/36/an-easy-quick-start-guide-to-cbs-open-data. CBSOData has around 3500 installations a month (Januari 2022).