PodcastDb Database and CSV Export Schema
The schema for Podcastdb database and CSV exports has several fields, and the following help docs explain each of them.
- rss_url: This field contains the RSS feed URL for the podcast. It is a unique identifier for the podcast.
- itunes_id: This field contains the unique identifier assigned by iTunes to the podcast. It is a unique identifier for the podcast.
- owner_name: This field contains the name of the owner of the podcast.
- author_name: This field contains the name of the author of the podcast.
- categories: This field contains the category or categories that the podcast belongs to.
- artwork_url: This field contains the URL of the artwork associated with the podcast.
- language: This field contains the language that the podcast is in.
- title: This field contains the title of the podcast.
- website: This field contains the website associated with the podcast.
- description: This field contains the description of the podcast.
- rss_generator: This field contains the name of the software that was used to generate the RSS feed for the podcast.
- import_locked: This field indicates whether the podcast is locked or not.
- funding_url: This field contains the URL of the funding page for the podcast.
- is_explicit: This field indicates whether the podcast contains explicit content or not.
- show_type: This field contains the type of show that the podcast is.
- keywords: This field contains the keywords associated with the podcast.
- totalaudioseconds: This field contains the total number of seconds of audio content in the podcast.
- avgepisodeduration: This field contains the average duration of each episode in the podcast.
- episode_count: This field contains the total number of episodes in the podcast.
- avgtimebetweenepisodesseconds: This field contains the average time between episodes in seconds.
- dtnewestepisode_published: This field contains the date and time of the newest episode published in the podcast.
- dtoldestepisode_published: This field contains the date and time of the oldest episode published in the podcast.
- newestepisodetitle: This field contains the title of the newest episode published in the podcast.
- newestepisodeshow_notes: This field contains the show notes for the newest episode published in the podcast.
- shownotescontain_timestamps: This field indicates whether the show notes contain timestamps or not.
- newestepisodeaudio_url: This field contains the URL of the audio file for the newest episode published in the podcast.
- checked_itunes: This field indicates whether the podcast has been checked on iTunes or not.
- host: This field contains the name of the podcast host.
- retail_host: This field indicates whether the podcast is a retail host or not.
- ability_iab: This field indicates whether the podcast is IAB compliant or not.
- ability_stats: This field indicates whether the podcast has statistics available or not.
- ability_tracking: This field indicates whether the podcast has tracking available or not.
- abilitydynamicaudio: This field indicates whether the podcast has dynamic audio available or not.
- avgfilesize_bytes: This field contains the average file size in bytes of the audio files associated with the podcast.
- totalfilesize_bytes: This field contains the total file size in bytes of the audio files associated with the podcast.
- id: This field is a unique identifier automatically generated by the database.
- copyright: This field contains the copyright information associated with the podcast.
- lasthttpcode: The last HTTP response code received from the RSS feed.
- prefix_analyzed: Whether the show's RSS feed URL has been analyzed to determine the show's host and categories.
- avg_rating: The average rating of the show on iTunes.
- total_ratings: The total number of ratings of the show on iTunes.
- itunes_suggested: An array of iTunes category IDs that the show is suggested to be listed under.
- rating_distribution: An array of integers representing the number of ratings in each rating value.
- release_frequency: The frequency at which new episodes of the show are released (e.g. "weekly", "biweekly", "monthly", etc.).
- created_date: The date when the show was added to the PodcastDB database.
- itunesarturl: The URL of the show's artwork on iTunes.
- itunesbgcolor: The background color of the show's artwork on iTunes.
- itunestextcolor1: The first text color of the show's artwork on iTunes.
- itunestextcolor2: The second text color of the show's artwork on iTunes.
- itunestextcolor3: The third text color of the show's artwork on iTunes.
- itunestextcolor4: The fourth text color of the show's artwork on iTunes.
- trackers: An array of URLs of tracking services used by the show.
- genre_names: An array of names of the genres that the show is listed under on iTunes.
- ratings_share: The percentage of ratings of the show that have each rating value.
- newrssurl: The updated RSS feed URL of the show (if it has changed).
- reverselikecount: The number of shows that have liked this show.
- reverse_likes: An array of IDs of the shows that have liked this show.
- ratingcountrank: The rank of the show based on the number of ratings it has.
- md5: The MD5 hash of the show's RSS feed content.
- facebook: An array of URLs of the show's Facebook profiles.
- instagram: An array of URLs of the show's Instagram profiles.
- linkedin: An array of URLs of the show's LinkedIn profiles.
- youtube: An array of URLs of the show's YouTube channels.
- twitter: An array of URLs of the show's Twitter profiles.
- discord: An array of URLs of the show's Discord servers.
- twitch: An array of URLs of the show's Twitch channels.
- youtube_profile: The URL of the show's YouTube profile page.
- twitch_profile: The URL of the show's Twitch profile page.
- facebook_profile: The URL of the show's Facebook profile page.
- linkedin_profile: The URL of the show's LinkedIn profile page.
- twitter_profile: The URL of the show's Twitter profile page.
- discord_profile: The URL of the show's Discord profile page.
- instagram_profile: The URL of the show's Instagram profile page.
- title_tsvec: A tsvector representation of the show's title, used for text search indexing.
- audience_size: An estimate of the show's audience size.
- sentiment: A value between -1 and 1 representing the sentiment of the show's audience, calculated based on their social media activity.
- ad_cost: An estimate of the cost of advertising on the show.
- mvp_rank: The MVP (Minimum Viable Product) rank of the show, calculated based on its popularity, ratings, and other factors.
- is_active: This column is a boolean flag that indicates whether the podcast is currently active or not.
- dtlastitunes_check: This column shows the timestamp of the last time the podcast's information was checked on iTunes.
- islistedon_itunes: This column is a boolean flag that indicates whether the podcast is currently listed on iTunes or not.
- dtlastcheck: This column shows the timestamp of the last time the podcast's information was checked on PodcastDB.