Podcast de facto Standard
I built a website and crawler to analyze the podcast ecosystem. The website contains various reports about the usage feed tags and audio properties.
For the information about tag usage, I used defusedxml and written some basic validators to check that the tags included are also valid according to there specification.
The audio analysis is done using ffmpeg and ffprobe. I only extract basic features provided by these tools.
I hope this information is helpful to the podcasting community and people building their own podcasting system.