Issue 15.1

AudioVisual Data in DH

PodcastRE Analytics: Using RSS to Study the Cultures and Norms of Podcasting

  • Eric Hoyt
  • J.J. Bersch
  • Susan Noh
  • Samuel Hansen
  • Jacob Mertens
  • Jeremy Wade Morris

  • metadata
  • data visualization
  • data curation
  • digitization

    Introduction

    As a cultural form, podcasting resists easy definition. It is a highly porous medium, traveling with us over earbuds, phone speakers, and car stereos, accompanying us on commutes, jogs, errands, and road trips. It’s a sound-based media that we also experience visually through live shows, thumbnail icons, and t-shirts that say “Friend of the Pod” or “Night Vale Community College.” Despite these definitional challenges, the medium, by most measures, is booming — with the quantity of podcasts, listeners, advertising revenue, and non-profit funding increasing sharply year after year, including an “explosive” 2018, which saw the number of U.S. people over the age of twelve who have ever listened to a podcast climb above 50% for the first time 1 2.

    If we consider podcasts from a purely technical standpoint, it is possible to narrow the definition slightly. As we have elaborated elsewhere, a podcast can be defined as a collection of downloadable files, of any format, served, with accompanying metadata, via an open updatable internet feed, primarily RSS 3. An XML-based protocol, RSS allows for podcasters to easily publish their completed work and distribute it to audiences, who can opt to subscribe to particular feeds. In many ways, the metadata and the open feed are what separate a podcast from other media files on the internet, including other forms of on-demand audio (for example, music streaming platforms and audio book companies). Because RSS feeds are open, podcasting is platform-independent. Listeners can subscribe to feeds through a number of different podcatching apps and a variety of platforms. At least for now.

    Both the expansive cultural meanings of podcasting and the rigid technical definition have animated our work over the past three years on the PodcastRE database and our desire to study and preserve this emerging format. Based at the University of Wisconsin-Madison and supported by grants from the university and the NEH, PodcastRE (short for Podcast Research and accessible at http://podcastre.org) is a data preservation and research initiative. As we write in April 2020, the PodcastRE database has grown to over 2.5 million podcast episodes from over 16,000 unique RSS feeds which occupy 99 terabytes of space within our RAID storage array. The collection has expanded beyond what any individual could listen to within a lifetime, and it only keeps growing.

    What can researchers do with millions of podcast episodes and their associated metadata? This article seeks to contribute to the body of digital humanities scholarship invested in harnessing the affordances of digital technology to investigate cultural data at a large scale 4 5 6 7. We argue that podcasting’s unique technical affordances (e.g. RSS and metadata) open up productive methods for exploring the cultural practices and meanings of the medium. These methods, in turn, hold broader relevance for scholars seeking to integrate media studies with computational analysis (or, as the theme of this special issue nicely puts it, “AV in DH” ). Our study of podcasting shows the ways that metadata can function as a surrogate for studying large collections of time-based media objects. To put it simply, it’s far easier to query 2 million metadata records than it is to query 2 million media files of movies, TV episodes, or audio programs. Yet our study also shows that, when it comes to born digital media, the metadata are never fully separate from the objects they describe, nor can they fully describe, or replace the need for, returning to the media themselves during the final analysis. As a result, future work in AV in DH needs to thoughtfully delineate between methods best suited for digitized media collections compared to those most appropriate for born digital media collections .

    In this article, we share three different methods for analyzing the metadata of PodcastRE’s born digital corpus, assessing the strengths and weaknesses of each method and sharing preliminary results. First, as we will share, PodcastRE’s Term Frequency Line Graph (http://podcastre.org/lineGraph) allows researchers to create visualizations of trending topics and keywords. Interpreting the results of the line graphs can be challenging, however, due to the messiness of the underlying metadata and the problem of “normalizing” a rapidly growing corpus and medium. Second, PodcastRE’s Associated Keyword Cloud visualization tool (available at http://podcastre.org/wordCloud) enables researchers to query a keyword and generate a word cloud that displays the other keywords that appear alongside that keyword in podcasts. We argue that this data visualization harnesses the potential of the medium’s inconsistent and messy metadata and allows for open-ended explorations, serendipitous discovery, and new questions about the agency of podcasters in self-defining their cultural output and connecting it with particular communities and conversations. Third and finally, we share approaches for studying the duration of podcasts. As a time-based media format, podcasting is unusual in that it is not bound by the programming schedules and technical limitations that provide strict parameters for most audiovisual forms, such as movies, television, and radio. If a podcast can run anywhere from a couple of seconds to several hours in length, how do norms and common practices develop that establish optimal models for a podcast’s duration? To investigate these questions, we exported CSVs from the database (using a “mediaFileDuration” field generated by the individual episode files), sorted them into meaningful sample groups, and investigated the data for patterns.

    Ultimately, our goal for this article is to share research and methods for studying the explosion of audio culture taking place in podcasting and through the sonic communities and conversations podcasting draws together. These methods are especially well suited for studying audio, but they would also be valuable for exploring online video collections and other digital media objects. As in any research study, though, it’s important to address the specific before offering broader generalizations. With that in mind, we would like to now turn to a consideration of the history and design of RSS feeds, how they inform the underlying dataset (the PodcastRE collection), and the affordances and challenges of these structures.

    RSS Feeds, the PodcastRE Collection, and Working with Messy Metadata

    As the protocol that has enabled inconsistent and idiosyncratic podcast metadata to proliferate across the internet, it is fitting that there is no singular consensus on what the initials “RSS” should stand for. Real Simple Syndication is the most commonly cited meaning. But “Rich Site Summary” and “Resource Description Framework (RDF) Site Summary” have also been cited as the basis of the name. There is no question, however, that the technology has played a pivotal role in the growth of podcasting and infrastructure of PodcastRE.

    On March 15, 1999, Netscape published the first specification for RSS 8. Based on XML and developed by Ramanathan Guha and Dan Libby, RSS was created so that the Netscape home page “My Netscape” could be refreshed with new content from webpages which used the specification 9. Over the next couple of years, RSS went through multiple iterations, and then on December 25, 2000, Dave Winer and UserLand software released RSS 0.92 8. It was this version of RSS which is most important in the history of podcasting as it was the first version which included the tag, which allowed for the attachment of media files. Concocted by Winer, with strong prompting from Adam Curry, as a way to deliver high quality multimedia files over the internet without the quality and wait time issues which plagued early streaming, the first use of an was to distribute a set of Grateful Dead MP3 files 10, presaging its dominant use in the years to come.

    RSS continued to develop for the next decade and mostly stabilized as a specification with version 2.0.11 on March 30, 2009 8. Since then the only updates to the format have come in the form of XML Namespaces, which are ways of adding outside-of-specification elements to XML documents that are commonly used by Apple, Google and other podcast distributors to expand metadata options for commercial purposes 11 12. Even as podcasting apps, playback technologies, and the on-demand sound industry has changed throughout the 2010s, the basic structure and syntax of RSS has remained constant, keeping the circulation of most podcasts relatively open and freely downloadable.

    The open infrastructure of RSS also became foundational for our work on PodcastRE 13. Interested in studying podcasts, but worried about the vulnerability of digital audio files, we realized in early 2014 that there were few searchable databases of podcasts for studying and analyzing the booming audio culture taking place in podcasting. We began rather humbly by logging RSS feeds manually in iTunes and downloading audio files to a local hard drive, tracking as best we could podcasts that were being cited in the press as part of the renewed interest in podcasting, like Serial or Welcome to Nightvale 14. As the project grew, we implemented a more coherent collection process, and since 2018, we have been saving podcasts included in discussions of podcasting’s “golden age” as well as interrogating what podcasts are being left out of that discussion. We’ve navigated the need to preserve the “popular” by automating the collection of a particular index of what’s popular: the Apple Podcasts top 100 lists for the U.S., Great Britain, France, and Australia every 24 hours. This automated approach toward collecting embraces both the affordances of the digital media and the MPLP (More Product, Less Process) model proposed by Mark A. Greene and Dennis Meissner 15. Our efforts to identify and collect significant podcasts beyond the Apple Podcasts top 100 have been driven by collaborations with scholars who are researching independent podcasts produced by women, indigenous peoples, and people of color 16 17 and by following the work of other networks, directories and databases devoted to highlighting marginalized/less visible podcasts (Podcasts in Color, Women In Podcasting, PotLuck Podcast Network, etc).

    PodcastRE’s collection of 2 million podcast episodes has thus been built by a combination of algorithmic methods and informed hand-selections. There’s also a “submit a podcast” feature on the project’s website that allows individuals to add the RSS feeds for podcasts they’d like preserved. The 16,000 archived podcast feeds are a fraction of the over 1,000,000 podcast feeds that, according to estimates, are currently being distributed 18. But the PodcastRE collection does offer a valuable and diverse cross-section of English-language podcasts from the past several years.

    The common thread through all of this work has been RSS. To put it simply, if a podcast doesn’t have an RSS feed, then we cannot yet preserve it within our system. This is one of the reasons why the technical definition of a podcast — an open feed of downloadable files and associated metadata — has been so important to our work on PodcastRE. To achieve our goals of scale, not only did we need to be able to download podcast episode files automatically, we also needed to gather the metadata we could store automatically. For PodcastRE, the elements available through the RSS specification, and its associated namespaces, are as important as the podcast episode files themselves. RSS, in other words, defines the possible universe of metadata for the podcasts archived in PodcastRE.

    What we did not immediately appreciate was how messy, idiosyncratic, and incomplete the world of podcasting metadata would prove to be. Podcast RSS metadata is a world away from the familiar and relatively consistent metadata fields of TEI and Dublin Core. One reason is the relatively sparse number of elements which are required for a feed to be valid. In fact, an RSS feed only needs four elements to be present in order to be valid: the parent element with associated, <link>, and <description> elements. This would be a feed without content though as it would contain no <item>s <sup id="fnref:19"><a href="#fn:19" class="footnote-ref" role="doc-noteref">19</a></sup>. Because authors fully manage their own RSS feeds, and the entry of the metadata into them, they are directly responsible for the depth and quality of the metadata. This aspect of podcast metadata cannot be stressed too highly. With the exception of a few elements like <a href="googleplay:category">googleplay:category</a> and <a href="itunes:type">itunes:type</a>, there are almost no constraints on what podcast authors put into the various elements. Even fixed format elements like <pubDate>, which seems rather self explanatorily to mean the date on which a podcast episode was published into a feed, can end up being used by authors to mean something very different. For example, there are many <pubDate>s before 1950 in the metadata for <em>The Reith Lectures</em> podcast from the BBC, long before the term podcast was ever coined. Instead, the series uses <pubDate> to mean the day the lecture was originally given. RSS authors continue to have the authority to change anything they wish — including something as fundamental as the title of an episode, or even their whole podcast, at any time. Looking in PodcastRE, we see examples related to branding, as when <em>Bookworm</em> added their network and became <em>KCRW’s Bookworm</em> ; or to SEO, as when <em>Highest Self Podcast</em> added some terms and turned into <em>Highest Self Podcast: Modern Spirituality, Ayurveda, Conscious Entrepreneurship, Mind-Body Balance</em> .</p> <p>The inconsistent and incomplete metadata records created major challenges for our efforts to systematically preserve podcasts and make them easily searchable. We found it especially unfortunate that metadata fields that could have been revelatory for search faceting and social network analysis (fields such as <network>, <host>, and <contributor>) are not a part of any current podcast RSS specifications. Yet it was equally clear that authoritative approaches to metadata had their own problems and major blind spots. The inadequacies and biases of Library of Congress subject headings have received considerable attention within the discipline of information studies. For example, Juliet L. Hardesty <sup id="fnref:20"><a href="#fn:20" class="footnote-ref" role="doc-noteref">20</a></sup> has argued that the subject headings generally take the primacy of white men as a default; “Robert Frost” is cataloged under “Poets, American” without reference to gender or race, whereas Maya Angelou is listed under subjects including “African American women authors” and “African American authors.” The catalogers, in these cases, are applying a schema that upholds a white patriarchal worldview and minimizes both the needs of users and the ways in which creators and subjects would choose to define themselves.</p> <p>In contrast, podcast creators have a tremendous amount of agency in how they define themselves and attempt to connect with users (i.e. listeners and audiences). When the creators of the PHX podcast entered the keywords “podsincolor” and “women of color” within their RSS feed, they actively chose to present themselves this way and place their work within a larger network of podcasts produced by people of color. The flexibility that characterizes metadata practices prove to be critical for marginalized podcasters in forming community, as they seek to carve out space for themselves within media production practices and platforms that consistently privilege hegemonic whiteness, accepted paradigms of masculinity, and heteronormativity. While this does not necessarily mean that self-policing within metadata production does not happen as a result of the asymmetrical power dynamics between platform and creator, it does still provide yet another avenue in which marginalized communities can stand in opposition to the individualistic neoliberal ideologies that undergird contemporary user-driven media production <sup id="fnref:21"><a href="#fn:21" class="footnote-ref" role="doc-noteref">21</a></sup> <sup id="fnref1:17"><a href="#fn:17" class="footnote-ref" role="doc-noteref">17</a></sup>. It is critical to note, however, that the non-uniformity of metadata production yields ambivalent practices, both where innovative podcasters can resist the influence of various dominant ideologies, while others use this space to reinforce their centrality simultaneously.</p> <p>For example, there are also many instances of podcasters stuffing their RSS with keywords in order to make them prominent within content aggregators and “podcatchers.” The internet abounds with advice and speculations for search engine optimization and strategies that can be utilized in order to gain attention to one’s content, such as the optimal number of keywords, the kinds of thumbnail images that should be connected to content, and more <sup id="fnref:22"><a href="#fn:22" class="footnote-ref" role="doc-noteref">22</a></sup> <sup id="fnref:23"><a href="#fn:23" class="footnote-ref" role="doc-noteref">23</a></sup>. The opaque nature of how Apple Podcasts organizes its search results impacts the manner in which metadata is written, and this influences the ways that podcast creators self-define their own content. The dominating influence of Apple Podcasts categories can be observed by the fact that within the entirety of the PodcastRE database, the most used keywords lists are dominated by terms that are outlined either fully or in part by Apple Podcasts genre specifications. For example, with the exception of the words, “podcast” and “radio,” the top fifteen keywords for the podcast classification (the entire podcast series) terms all cohere to various genre classifications within Apple Podcasts. Similar patterns can be seen for the episodic classification where, with the exception of “Talk Radio,” “Podcast” and a blank space/uncategorized, the top ten keywords reflect Apple Podcasts categories. The large amount of uncategorized keyword terms may gesture towards the fact that after 2013, the keywords metadata field became deprecated, meaning that it no longer affected the output of Apple Podcasts’ search engine algorithms (<a href="https://support.libsyn.com/kb/the-rss-feed/">https://support.libsyn.com/kb/the-rss-feed/</a>). After this discovery, many podcasters may have forgone the labor of adding keywords, as the fields that most influence search engine optimization are now the title, author, and description tags.</p> <p>Even though Apple Podcasts deprecated keywords within its search algorithm, we became excited about the role keywords could play for our work on PodcastRE. What sort of data visualizations and discoveries might be possible by harnessing RSS metadata at scale? Ultimately, we built two data visualizations for the site. Perhaps not surprisingly, the more successful of the two was the one that most embraced the idiosyncratic, messy, and user-created nature of RSS.</p> <h2 id="graphing-metadata-term-frequency-across-time">Graphing Metadata Term Frequency Across Time</h2> <p>How do keywords and other fields used to describe podcasts change over time? Could tracking these changes prove useful for spotting trending topics within the podcasting ecosystem? To explore these questions, we created PodcastRE’s Term Frequency Line Graph (publicly available at <a href="https://podcastre.org/lineGraph">https://podcastre.org/lineGraph</a>), which tracks the frequency across time that any word or phrase within the metadata appears. The fields searched include the title, creator, synopses, and keywords. A visualization graphing the term “money” within PodcastRE is displayed below. If a user clicks on any point within the graph, their browser opens up a new tab displaying all of the matching podcast episodes from that month or year that contain a matching search term.</p> <p>The Term Frequency Line Graph searches metadata included within individual podcast episodes and across the entire feeds (for example, while “NBA” may be a keyword that describes a podcast feed as a whole, “China” may be a keyword that describes a topic discussed within one episode of the podcast). By default, the X-axis of the graph is divided by years; however, users can toggle to a monthly scale. This allows for researchers to see when certain topics or keyword phrases spike on a seasonal cycle (for example, “baseball” consistently has an uptick during the playoffs every October) versus more macro-scale trends that rise and fall over a period of years.</p> <figure ><img loading="lazy" alt="" src="/dhqwords/vol/15/1/000519/resources/images/figure01.png" sizes="(max-width: 768px) 100vw, 80vw" srcset="/dhqwords/vol/15/1/000519/resources/images/figure01_huc39da1dd63b871ccf7db8e653a5ab7aa_101684_500x0_resize_box_3.png 500w, /dhqwords/vol/15/1/000519/resources/images/figure01_huc39da1dd63b871ccf7db8e653a5ab7aa_101684_800x0_resize_box_3.png 800w,/dhqwords/vol/15/1/000519/resources/images/figure01_huc39da1dd63b871ccf7db8e653a5ab7aa_101684_1200x0_resize_box_3.png 1200w,/dhqwords/vol/15/1/000519/resources/images/figure01.png 1200w" class="landscape" ><figcaption> <p>Term Frequency Line graph of “money” , tracked over time, within PodcastRE’s corpus. Tool is available at <a href="http://podcastre.org/lineGraph">http://podcastre.org/lineGraph</a>. </p> </figcaption> </figure> <p>When researchers use the Term Frequency Line Graph to look for trends across a span of years, however, they quickly encounter an interpretive challenge: almost any term they search will appear to dramatically increase in 2017 and 2018. This is because the PodcastRE collection grew exponentially over those two years, a result of the growth in the podcasting ecosystem as a whole and our own curatorial decision to automatically preserve any feed that appears on the Apple Podcasts Top 100 chart in the U.S., U.K., Australia, or France. While we give users the ability to “normalize” the graph results (which employs an equation to account for the larger number of podcasts from some years compared than others), we know this feature has its limits. What does it mean to “normalize” the number of podcasts during a period in which the medium is rapidly evolving?</p> <p>We have tried to address this interpretive challenge through a “Rate of Episodes Added” button, which provides contextualization in regard to the database itself. By showing how many episodes are added per year, users can see how the rate of growth in the database can affect the numbers that are being shown for any query’s term frequency. Additionally, the “Area Graph” button transforms the data into a stacked graph, which allows for comparisons across multiple queries at particular moments in time and reminds users that the graphs are malleable and dynamic. Finally, the user can move to a more granular level at any point by clicking on a point in the graph, allowing them to investigate the actual podcast feeds and episodes that appear as abstractions within the graph. Users can save the data to a CSV file, a JPG, PNG or SVG vector image, so that this data can be applied to a variety of presentational contexts.</p> <p>In many ways, PodcastRE’s Term Frequency Line Graph exemplifies the limitations digital humanists are likely to encounter when applying data visualizations built for <em>digitized text collections</em> to <em>born-digital media collections</em> . We modeled the user-experience and technological framework of PodcastRE’s Term Frequency Line Graph on that of the Arclight app (<a href="http://projectarclight.org">http://projectarclight.org</a>), which searches the 2.5 million page corpus of the Media History Digital Library (MHDL) <sup id="fnref:24"><a href="#fn:24" class="footnote-ref" role="doc-noteref">24</a></sup>. The MHDL is composed of books and magazines pertaining to the histories of film, broadcasting, and recorded sound from 1915 to 1960, which is an especially robust period for the searching of named entities (such as people, film titles, or radio station call letters). Additionally, the normalization function for Arclight graphs works quite well (the most represented year of 1915-1960 is only double in size of the least represented year, avoiding PodcastRE’s challenge of grappling with exponential growth). Normalized searches for the names of movie stars, for example, generally map onto the arcs of their popularity and/or notoriety, sometimes, though not always, with surprising results. Data visualizations built for searching entities within large corpora of digitized texts are less adept at producing immediately legible results for searching the metadata keywords of a rapidly growing born-digital medium. What would it mean to design a data visualization tool that embraced the messiness of born-digital objects and their metadata, rather than trying to smooth them out?</p> <h2 id="associated-keyword-word-cloud">Associated Keyword Word Cloud</h2> <p>In developing PodcastRE’s Associated Keyword Word Cloud, we sought to harness and foreground the specificities and idiosyncrasies of born digital media collections. This data visualization takes the keywords that podcasters entered to describe their work and puts them into conversation with other podcasters’ keywords. A specific example is helpful for understanding how it works.</p> <p>Using the keyword “money” , in a search conducted in the fall of 2019, we found the term appeared in the metadata of 68,619 podcast episodes saved within PodcastRE, collected from 587 discrete RSS feeds. The other keywords that appear most frequently along with “money” in podcast metadata are visualized below (see <a href="#figure02">Figure 2</a>). This visualization includes predictable matches within the popular financial self-help genre (e.g. “wealth,” “business,” “entrepreneur” ), as well as meaningful intersections that lay outside financially-oriented podcasts (e.g. “spirituality,” “Relationships & Sex,” “Fear” ). When a user clicks on the keyword value in the cloud, the user is immediately transferred to the PodcastRE database interface, where it shows all of the podcasts that used these paired keyword values. <a href="#figure03">Figure 3</a> reveals the results of the podcasts that contain both the keywords “money” and “spirituality.” The process promotes serendipitous discovery and may lead the researcher toward encounters they hadn’t anticipated. For example, modern witchcraft is better represented in the podcasts with “money” and “spirituality” as keywords than most traditional forms of organized religion.</p> <figure ><img loading="lazy" alt="" src="/dhqwords/vol/15/1/000519/resources/images/figure02.png" sizes="(max-width: 768px) 100vw, 80vw" srcset="/dhqwords/vol/15/1/000519/resources/images/figure02_hu3e21c030c9b47dc14788eec1edc04fcc_293842_500x0_resize_box_3.png 500w, /dhqwords/vol/15/1/000519/resources/images/figure02_hu3e21c030c9b47dc14788eec1edc04fcc_293842_800x0_resize_box_3.png 800w,/dhqwords/vol/15/1/000519/resources/images/figure02.png 975w" class="landscape" ><figcaption> <p>Word cloud for the query, “money,” on the “All Keywords” category. Taken using the Associated Keyword Cloud visualization tool at <a href="http://podcastre.org/wordCloud">http://podcastre.org/wordCloud</a>. </p> </figcaption> </figure> <figure ><img loading="lazy" alt="" src="/dhqwords/vol/15/1/000519/resources/images/figure03.png" sizes="(max-width: 768px) 100vw, 80vw" srcset="/dhqwords/vol/15/1/000519/resources/images/figure03_hu984dfb871bdc9e9a5abfe8172e558e41_251276_500x0_resize_box_3.png 500w, /dhqwords/vol/15/1/000519/resources/images/figure03_hu984dfb871bdc9e9a5abfe8172e558e41_251276_800x0_resize_box_3.png 800w,/dhqwords/vol/15/1/000519/resources/images/figure03.png 974w" class="landscape" ><figcaption> <p>Screen shot of the results page for podcast episodes containing the keywords “money” and “spirituality.” </p> </figcaption> </figure> <p>The Associated Keyword Cloud visualization was built through connecting together multiple open source technologies. Like the Term Frequency Line Graph, the Associated Keyword Word Cloud uses the Highcharts Javascript library to animate the visualization. To retrieve the information it needs, we query the keyword metadata facet within PodcastRE’s Solr search index, and we return and store them as key value pairs, with the number of podcasts that maintain both the queried keyword and the additional keyword (the hit count) next to the particular word. For example, if a user queries the word “love” within the database, a potential key value pair that would appear would be “[ relationships , 163],” where “relationships” would be the associated keyword for “love,” and the “163” stands for how many times this keyword was added alongside the word “love.” The results are sorted through keywords that have the most hits down to the associated keywords that have the least hits. By targeting this metadata keyword field and assigning the “weight” of a word to be the number that is assigned to the hit count of the key value pair, we were able to visually represent which keywords were paired most often with the queried word, by making the word with the heaviest “weight,” the largest in the word cloud. Because certain topics have a range of associated keywords that spanned hundreds of words, we limited the number of keywords that can be shown on the word cloud to a maximum of 200 words. While this decision may hinder researchers from getting the full range of associated keywords, this limitation was imposed to ensure readability on the visualization. Two hundred words seemed like a reasonable count in order for researchers to gain a sense of the wide range of relational topics that podcasters were dealing with, and simultaneously allow the visualizations to be effective in showing which keywords were the most actively engaged with.</p> <p>There are two options on the Associated Keyword Word Cloud interface that aid in isolating whether the keywords shown are related to podcasts in their entirety or exclusive to certain episodes. Additionally, if users want a merging of these two levels of metadata, they can search across both podcast and episode keywords by using the “All Keywords” option. In this manner, for podcasts that may deal with a wide range of topics, such as news or current events podcasts, there can be a closer examination on a micro episodic level of what kinds of keywords are used to define certain topical content. Often, the keywords that are used to describe podcasts are not uniformly applied to define episodes, so providing these two levels of analytical range gives researchers more flexibility in the kinds of questions they can ask using PodcastRE.</p> <p>All attempts to interpret the Associated Keyword Word Clouds ultimately lead back to reflecting on the practices, norms, aspirations, and communities of the podcasters themselves.</p> <p>As discussed earlier, keywords allow content creators to define their work to listeners and podcatcher applications. They are a space of creator agency, where podcast producers deploy keywords to create networks of ambient affiliation with other podcasts and subject matter. By making one of PodcastRE’s database visualization tools intimately connected to these creator-defined keywords and their relationships to other keywords, we provide an alternative mode of discoverability apart from the algorithms that govern commercial aggregators such as Apple Podcasts. In this manner, PodcastRE hopes to provide a different approach that foregrounds creator agency and their interactions with their own metadata through the digital archive’s organization, particularly with these metadata visualizations.</p> <h2 id="studying-the-durations-of-podcasts">Studying the Durations of Podcasts</h2> <p>The Term Frequency Line Graph and Associated Keyword Word Cloud can both be effectively applied toward exploratory research and achieving serendipitous discoveries. But we also wanted to use PodcastRE and the “mediaDuration” field to examine a more focused question. What patterns can we notice about the duration of podcasts, and what can they tell us about practitioner norms and assumptions of what makes for a good length of a podcast? Unlike most other AV forms — movies, television, and radio — podcasts are a time-based medium that are not constrained by programming schedules (broadcast schedules, movie theater showtimes) and technical limitations (reels of film and tape). If a podcast could run anywhere from a couple of seconds to several hours in length, how do norms and common practices develop around perceived ideas of a podcast’s optimal duration? We realized that metadata could help us answer this question.</p> <p>In this section, then, we propose and share two approaches to studying podcast duration. First, we consider how duration analysis might clarify the differences between two programs of the same specification classification, in this case two popular daily programs from <em>The New York Times</em> and <em>NPR</em> , using data gathered from episodes ranging from the former’s launch in early 2017 to an end point of April 2018. Second, we conduct an investigation of a much larger scale, analyzing large rosters of programs to juxtapose duration across networks and genres. Our case studies here are the comedic programs of Earwolf and the comparably more serious fare of Gimlet Media, using data gathered from episodes ranging from 2009 until early 2018. In both of these cases, the statistics were gathered by first running an SQL query on the PodcastRE database, then exporting metadata for all of the episodes into a .csv file, and finally finding averages, medians, and other numbers using Microsoft Excel. All of these approaches required us to assemble subsets of data from within the PodcastRE collection (and the .csv files), rather than treating the entire collection as a dataset.<sup id="fnref:25"><a href="#fn:25" class="footnote-ref" role="doc-noteref">25</a></sup> The genre and network categories that we ourselves added to the spreadsheets opened the data up for more meaningful analysis, especially when paired with the duration metadata provided by the RSS feeds.</p> <p>Our first approach to studying duration explored what has become one of the most popular contemporary podcast formats: the daily news program. How long should a daily news podcast take to consume? When <em>The New York Times</em> launched <em>The Daily</em> in February of 2017, host Michael Barbaro described the fledgling program thusly: “This is how the news should sound. Fifteen minutes a day, five days a week. It isn’t quite a podcast — although you can listen wherever you listen to podcasts. It isn’t quite the radio — although the mechanics are largely the same. It isn’t quite the newspaper — although we’ll be drawing heavily on the journalism that powers The New York Times” <sup id="fnref:26"><a href="#fn:26" class="footnote-ref" role="doc-noteref">26</a></sup>. Though Barbaro pegged the program as difficult to explain, it was a nearly immediate hit, gaining over five million monthly listeners by July of 2018 <sup id="fnref:27"><a href="#fn:27" class="footnote-ref" role="doc-noteref">27</a></sup>. As Barbaro told <em>Vanity Fair</em> that same month, “When we started the show, we had many goals… We didn’t realize we were going to make money that was actually going to get pumped back into the company” <sup id="fnref:28"><a href="#fn:28" class="footnote-ref" role="doc-noteref">28</a></sup>. Yet as is often the case, success breeds imitators and competitors, and <em>The Daily</em> witnessed the rise of its biggest challenger in June of 2017 when NPR launched <em>Up First</em> , a daily “10-minute morning news podcast” that is “designed with digital listeners in mind but will also serve as a preview of the news stories that will be treated in depth on public radio stations across the country throughout the day” <sup id="fnref:29"><a href="#fn:29" class="footnote-ref" role="doc-noteref">29</a></sup>. That program was also a swift triumph, and as of October 2018, both <em>The Daily</em> and <em>Up First</em> sat comfortably in the top five most popular podcasts according to Podtrac’s rankings: the former tailed behind only <em>Serial</em> , while the latter occupies the fifth spot <sup id="fnref:30"><a href="#fn:30" class="footnote-ref" role="doc-noteref">30</a></sup>.</p> <p>Episode duration has been a central selling point for each of the two podcasts. As seen above, both of the series’ launch press releases mention episode length. Descriptions of the programs on their official websites also focus on duration. <em>Up First</em> has remained consistent in its advertised average runtime: “NPR’s <em>Up First</em> is the news you need to start your day. The biggest stories and ideas — from politics to pop culture — in 10 minutes” <sup id="fnref:31"><a href="#fn:31" class="footnote-ref" role="doc-noteref">31</a></sup>. <em>The Daily</em> , meanwhile, has added five minutes to its initial announcement: “This is how the news should sound. Twenty minutes a day, five days a week, hosted by Michael Barbaro and powered by New York Times journalism” <sup id="fnref:32"><a href="#fn:32" class="footnote-ref" role="doc-noteref">32</a></sup>. The programs are, essentially, two different approaches to the morning commute: <em>Up First</em> ’s proposed shorter length seems guaranteed to slot into almost any daily trip to work, while <em>The Daily</em> ’s longer runtime requires either a lengthy commute, multiple listening sessions, or even perhaps the utilization of 1.5x or 2x speed playback options. Such duration decisions are complimented by storytelling approaches: <em>Up First</em> ’s short length is matched with a “greatest hits” style compilation of short stories, while <em>The Daily</em> ’s relatively lengthier duration is primarily spent on the discussion of a single story. In theory, then, the former aims to quickly provide its listeners with headline-style blurbs about the day’s biggest stories, while the latter seeks to exhaustively cover a single topic.</p> <p>Such temporal differences are roughly borne out by the metadata found in PodcastRE’s database, although the story is more complicated than the descriptions of the series imply. <em>The Daily</em> (mean duration of 22:51, median duration of 22:12) runs nearly ten minutes longer than <em>Up First</em> (mean duration of 13:33, median duration of 13:17), with both programs on average running a few minutes longer than their advertised lengths. The differences between the two series is much starker when considering the range in podcast durations, as <em>Up First</em> is relatively consistent in episode duration (shortest episode of 11:01 and longest episode of 17:46 for a range of 6:45) while <em>The Daily</em> varies widely between episodes (shortest episode of 13:00 and longest episode of 41:23 for a range of 28:23). These durational differences align neatly with the programs’ content choices (i.e. multiple headlines vs. single story focus), though they provide critical additional clarifications. While both <em>Up First</em> and <em>The Daily</em> release episodes each weekday morning, the former’s tight range and shorter length ties it more closely to its proposed function as morning commute listening, while the latter’s wider range and extended runtime emphasizes delivering a full story adequately. Since podcasts do not have the same durational constraints of broadcast media, these choices in runtime are clear aesthetic and storytelling decisions – yet given the evolutionary radio approach of NPR’s daily podcast and <em>The New York Times</em> ’s commitment to the news story, these decisions are not completely detached from their companies’ original mediums.</p> <p>On a larger scale, podcast duration analysis can point towards divergent approaches by podcast networks and in certain genres. As an example, we conducted an analysis of thirty Earwolf programs<sup id="fnref:33"><a href="#fn:33" class="footnote-ref" role="doc-noteref">33</a></sup> and nineteen Gimlet Media programs.<sup id="fnref:34"><a href="#fn:34" class="footnote-ref" role="doc-noteref">34</a></sup> The former describes itself as “the leading comedy podcast network devoted to creating the best, funniest, and most entertaining podcast shows in existence” <sup id="fnref:35"><a href="#fn:35" class="footnote-ref" role="doc-noteref">35</a></sup>. Gimlet Media specializes in more “serious” fare, characterizing itself as “the award-winning narrative podcasting company that aims to help listeners better understand the world and each other” <sup id="fnref:36"><a href="#fn:36" class="footnote-ref" role="doc-noteref">36</a></sup>. Though both companies employ personnel who have worked or continue to work in other mediums, Earwolf and Gimlet distinguish themselves from other major podcast networks such as <em>NPR</em> , <em>iHeartRadio</em> , and <em>WNYC Studios</em> through their podcast nativism: both companies began as strictly podcast-focused networks rather than emerging within older media companies.</p> <p>Perhaps as a result, the two networks have markedly different approaches towards podcast episode length. Of the 30 surveyed Earwolf programs, 3 have average runtimes between 0-20 minutes, 4 have average runtimes between 20-40 minutes, 6 have average runtimes between 40-60 minutes, 10 have average runtimes between 60-80 minutes, 3 have average runtimes between 80-100 minutes, 3 have average runtimes between 100-120 minutes, and 1 has an average runtime between 120-140 minutes (See <a href="#figure04">Figure 4</a>). This means that over half of the surveyed programs have average episode durations over an hour, with programs ranging from <em>Eardrop</em> ’s average runtime of 3:17 and <em>Never Not Funny</em> ’s average runtime of 2:05:33. The shortest single episode was a 38-second <em>Eardrop</em> episode, while the longest individual episode was a <em>Comedy Bang! Bang!</em> that lasted 3:19:02. Earwolf’s individual shows also frequently feature drastic ranges in shortest and longest duration: <em>Hollywood Handbook</em> , for instance, has a range of 1:53:27 between its shortest and longest episodes, while <em>Comedy Bang! Bang!</em> ’s range is 2:39:06.</p> <figure ><img loading="lazy" alt="" src="/dhqwords/vol/15/1/000519/resources/images/figure04.png" sizes="(max-width: 768px) 100vw, 80vw" srcset="/dhqwords/vol/15/1/000519/resources/images/figure04_hu9fad66afa02ed2edfa029cba9f2f7dde_1455849_500x0_resize_box_3.png 500w, /dhqwords/vol/15/1/000519/resources/images/figure04_hu9fad66afa02ed2edfa029cba9f2f7dde_1455849_800x0_resize_box_3.png 800w,/dhqwords/vol/15/1/000519/resources/images/figure04_hu9fad66afa02ed2edfa029cba9f2f7dde_1455849_1200x0_resize_box_3.png 1200w,/dhqwords/vol/15/1/000519/resources/images/figure04_hu9fad66afa02ed2edfa029cba9f2f7dde_1455849_1500x0_resize_box_3.png 1500w,/dhqwords/vol/15/1/000519/resources/images/figure04_hu9fad66afa02ed2edfa029cba9f2f7dde_1455849_1800x0_resize_box_3.png 1800w,/dhqwords/vol/15/1/000519/resources/images/figure04.png 2100w" class="landscape" ><figcaption> <p>Average durations of 30 surveyed Earwolf programs. </p> </figcaption> </figure> <p>Gimlet, on the other hand, is much more consistent in its runtimes across series, though there is still variation between individual episodes. Of the 19 surveyed programs, only 1 had an average runtime between 0-20 minutes, and that program ( <em>Chompers</em> ) serves a specific and brief function: children are meant to listen to the series as they brush their teeth. 2 of the series had an average runtime between 40-60 minutes, though both of those shows ( <em>Twice Removed</em> and <em>Mystery Show</em> ) are no longer producing episodes. The other 17 series, then, had average run times between 20-40 minutes, aligning Gimlet’s roster with conventional advice on podcast episode length.<sup id="fnref:37"><a href="#fn:37" class="footnote-ref" role="doc-noteref">37</a></sup> While individual episodes still varied quite a bit ( <em>Reply All</em> , for instance, had a range of 1:35:29 between its shortest and longest episodes, while <em>Mogul</em> had a range of 1:16:17), these ranges were still much smaller than the largest Earwolf ranges.</p> <figure ><img loading="lazy" alt="" src="/dhqwords/vol/15/1/000519/resources/images/figure05.png" sizes="(max-width: 768px) 100vw, 80vw" srcset="/dhqwords/vol/15/1/000519/resources/images/figure05_hu96728d905567300e6bd6470df4face79_1417668_500x0_resize_box_3.png 500w, /dhqwords/vol/15/1/000519/resources/images/figure05_hu96728d905567300e6bd6470df4face79_1417668_800x0_resize_box_3.png 800w,/dhqwords/vol/15/1/000519/resources/images/figure05_hu96728d905567300e6bd6470df4face79_1417668_1200x0_resize_box_3.png 1200w,/dhqwords/vol/15/1/000519/resources/images/figure05_hu96728d905567300e6bd6470df4face79_1417668_1500x0_resize_box_3.png 1500w,/dhqwords/vol/15/1/000519/resources/images/figure05_hu96728d905567300e6bd6470df4face79_1417668_1800x0_resize_box_3.png 1800w,/dhqwords/vol/15/1/000519/resources/images/figure05.png 2100w" class="landscape" ><figcaption> <p>Average durations of 19 surveyed Gimlet programs. </p> </figcaption> </figure> <p>The relative homogenization of Gimlet Media podcast duration, then, stands in stark contrast to the diverse podcast lengths of Earwolf. Every Gimlet Media podcast had an average run time under an hour, and 84.2% of the shows surveyed had average runtimes between 20-40 minutes. 56.7% of the Earwolf podcasts analyzed, meanwhile, had average runtimes over an hour long, and 76.7% of Earwolf’s podcasts had average runtimes over 40 minutes long — in other words, over three-quarters of Earwolf’s shows ran longer on average than Gimlet’s “sweet spot.” Individual episode lengths varied in each of the networks’ programs, but Gimlet’s programs featured smaller ranges than the large variation found in many of Earwolf’s programs. Such differences may be the result of institutional decisions, generic divergences, or series lengths. Whatever the cause, however, Earwolf and Gimlet serve as evidence that podcast networks can have wildly divergent approaches towards episode duration, and that studying duration can lead us to insights about genre conventions, production values and more.</p> <p>On a recent episode of the <em>Start Up</em> podcast, the show’s host, and Gimlet CEO, Alex Blumberg was reflecting on his decision to sell Gimlet media to Spotify. He noted that Gimlet’s gambit to standardize the production of highly edited and tightly produced ‘quality’ podcasts (that often followed very specific duration and other editorial decisions) had turned out to be a financially unfeasible strategy that was losing ground to cheaper and more popular chat cast style podcasts (where duration and other attributes are more flexible given the lower costs involved for editing and polishing the finished piece). His comments are a reminder that, despite the format’s substantial growth in the last two decades, there are still many lingering questions about the forms, conventions and economics of podcasting. We believe it is especially crucial during this time of flux, before podcasting stabilizes like so many other media have, to study the different approaches podcasters of all types are taking as they experiment with this emerging sonic format. Although duration numbers seem like relatively innocuous or descriptive metadata, the research from PodcastRE suggests they reveal historical relationships between new and old media formats, industrial and economic assumptions about “ideal” formats, and generic conventions that shape both amateur and professional podcasts.</p> <h2 id="conclusion">Conclusion</h2> <p>Our work on PodcastRE has aimed to provide tools and data that account for podcasting’s complexity as a cultural form while simultaneously taking advantage of its unique technical affordances. The centering of RSS metadata and what can be mined from it through advanced search, graphing keywords over time, or visualizing word clouds of associated keywords has helped us facilitate the automated collection of a significant corpus of podcasts from a crucial period in the format’s emergence. It has also facilitated novel, fine-grained exploration of specific file characteristics as well, like duration metrics, across a variety of genres and shows.</p> <p>The reliance on RSS, however, has also forced us to confront the messiness and intricacies of a born digital object whose metadata and descriptive features are dynamic and podcaster generated. Podcasting’s relatively open and accessible origins have helped create a vibrant environment for web-based audio – one that includes the scores of podcasts available and the multiplicity of voices behind them, but also the numerous apps, aggregator sites and distribution technologies that have emerged to support podcasting’s rise. RSS and XML have not only been important to our work on PodcastRE, but to podcasting more broadly, and to the agency and control it has provided podcasters for defining their work on their own terms as well as for listeners in terms of defining their listening practices. Recently, there have been a number of attempts to move away from the more open and accessible versions of podcasting, to more closed and profitable models (e.g. exclusive shows tied to one platform, like Spotify, or subscription-based services like Luminary). While these options may make podcasting more user-friendly and convenient, or may offer podcasters more options for monetizing their work, they also make podcasts more platform-dependent, less analyzable, and less open to research. </p> <p>The centrality of RSS to both podcasting and PodcastRE has been a theme throughout this article. We believe our methods and findings, however, hold relevance for beyond scholars researching other topics at the intersection of media studies and DH. As this study has shown, metadata records can serve as surrogates for studying large collections of time-based media objects, allowing researchers to query the durations of millions of media objects in a fraction of the time it would take to ingest and analyze transcoded media files. Yet our work has also shown that, when it comes to born digital media, the metadata are never fully separate from the objects they describe, nor are they fully capable of replacing close listening and other media studies methods. There is a need to delineate between methods best suited for <em>digitized media collections</em> compared to those most appropriate for <em>born digital media collections</em> and for devising strategies to blend AV and DH methods. By making these distinctions, we can better apply DH to AV and identify change and continuity, at a large scale, across media history.</p> <div class="footnotes" role="doc-endnotes"> <hr> <ol> <li id="fn:1"> <p>Edison Research, “The Infinite Dial 2019,” Edison Research (2019): <a href="https://www.edisonresearch.com/infinite-dial-2019/">https://www.edisonresearch.com/infinite-dial-2019/</a>. <a href="#fnref:1" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:2"> <p>PodNews. “The Total Number of Available Podcasts Is Now 700,000,” PodNews (2019): <a href="https://podnews.net/update/700000">https://podnews.net/update/700000</a>. <a href="#fnref:2" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:3"> <p>Hansen, S. “The Feed is the Thing: How RSS Defined PodcastRE and Why Podcasts May Need to Move On.” In J.W. Morris and E. Hoyt (eds), <em>Saving New Sounds: Dispatches from the PodcastRE Project</em> , University of Michigan Press, Ann Arbor (forthcoming 2020). <a href="#fnref:3" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:4"> <p>Jockers, M. L. <em>Macroanalysis: Digital methods and literary history</em> . University of Illinois Press, Champaign (2013). <a href="#fnref:4" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:5"> <p>Underwood, T. <em>Distant Horizons: Digital Evidence and Literary Change</em> . University of Chicago Press, Chicago (2019). <a href="#fnref:5" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:6"> <p>Clement, T. E. “Towards a Rationale of Audio-Text” <em>Digital Humanities Quarterly</em> , 10.2 (2016). <a href="#fnref:6" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:7"> <p>Clement, T. E. “When Texts of Study Are Audio Files: Digital Tools for Sound Studies in DH.” In S. Schreibman, R. Siemens, and J. Unsworth (eds), <em>A New Companion to Digital Humanities</em> , Chichester ; Malden, MA: John Wiley & Sons, Ltd., Chichester (2016): 348-57. <a href="#fnref:7" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:8"> <p>RSS Advisory Board. “RSS History.” RSS Board (n.d.): <a href="http://www.rssboard.org/rss-history">http://www.rssboard.org/rss-history</a>, accessed February 23, 2019. <a href="#fnref:8" class="footnote-backref" role="doc-backlink">↩︎</a> <a href="#fnref1:8" class="footnote-backref" role="doc-backlink">↩︎</a> <a href="#fnref2:8" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:9"> <p>Hines, M. “Netscape Broadens Portal Content Strategy,” Newsbytes. (1999): <a href="http://link.galegroup.com/apps/doc/A54120248/ITOF?u=umuser&sid=ITOF&xid=377f45">http://link.galegroup.com/apps/doc/A54120248/ITOF?u=umuser&sid=ITOF&xid=377f45</a>. <a href="#fnref:9" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:10"> <p>Winer, D. “Payloads for RSS.” (2001): <a href="https://web.archive.org/web/20080214205403/http://www.thetwowayweb.com/payloadsforrss">https://web.archive.org/web/20080214205403/http://www.thetwowayweb.com/payloadsforrss</a>, accessed February 23, 2019. <a href="#fnref:10" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:11"> <p>Bray, T., Hollander, D., Layman, A., Tobin, R., & Thompson, H. S. “Namespaces in XML 1.0 (Third Edition),” W3 (2009): <a href="https://www.w3.org/TR/xml-names/">https://www.w3.org/TR/xml-names/</a>, accessed February 23, 2019. <a href="#fnref:11" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:12"> <p>Bergen, M. “Google Brings Podcasting to Play Music, Swinging at Apple’s Dominance,” Recode (2015). <a href="#fnref:12" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:13"> <p>Morris, J. W., Hansen, S., & Hoyt, E. “The PodcastRE Project: Curating and Preserving Podcasts (and Their Data)” <em>Journal of Radio & Audio Media</em> , 26.1 (2019). <a href="#fnref:13" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:14"> <p>Adams, D. “After ‘Serial,’ Sponsors Pour Money into Podcasts,” <em>The Boston Globe</em> (2015): <a href="https://www.bostonglobe.com/business/2015/02/13/after-serial-sponsors-pour-money-into-podcasts/OKAzhUWtqCHQbl3IuEIiBN/story.html">https://www.bostonglobe.com/business/2015/02/13/after-serial-sponsors-pour-money-into-podcasts/OKAzhUWtqCHQbl3IuEIiBN/story.html</a>, accessed November 26, 2019. <a href="#fnref:14" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:15"> <p>Greene, M. and Meissner, D. “More Product, Less Process: Revamping Traditional Archival Processing,” <em>The American Archivist</em> , 68.2 (2005): 208–63. <a href="#fnref:15" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:16"> <p>Wang, J. H. “The Perils of Ladycasting: Podcasting, Gender, and Alternative Production Cultures.” In J.W. Morris and E. Hoyt (eds), <em>Saving New Sounds: Dispatches from the PodcastRE Project</em> , University of Michigan Press, Ann Arbor (forthcoming 2020). <a href="#fnref:16" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:17"> <p>Florini, S. “This Week in Blackness, the George Zimmerman acquittal, and the production of a networked collective identity.” <em>New Media & Society</em> 19.3 (2017): 439-454. <a href="#fnref:17" class="footnote-backref" role="doc-backlink">↩︎</a> <a href="#fnref1:17" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:18"> <p>Podcast Insights. “2020 Podcast Stats & Facts (New Research From Apr 2020),” Podcast Insights: <a href="https://www.podcastinsights.com/podcast-statistics/">https://www.podcastinsights.com/podcast-statistics/</a> <a href="#fnref:18" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:19"> <p>Winer, D. “RSS 2.0 Specification,” RSS 2.0 at Harvard Law (2015): <a href="https://cyber.harvard.edu/rss/rss.html">https://cyber.harvard.edu/rss/rss.html</a>, accessed February 23, 2019. <a href="#fnref:19" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:20"> <p>Hardesty, J. “Bias and Inclusivity in Metadata: Awareness and Approaches” . Indiana University Digital Collection Services. <a href="#fnref:20" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:21"> <p>Hogan, M. “Dykes on Mykes: Podcasting and the Activist Archive.” <em>TOPIA: Canadian Journal of Cultural Studies</em> 20 (2008): 199-215. <a href="#fnref:21" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:22"> <p>Crowe, Anne. “101 Quick & Actionable SEO Tips That Are HUGE.” <em>Search Engine Journal.</em> October 21, 2017. <a href="https://www.searchenginejournal.com/101-quick-seo-tips/180563/">https://www.searchenginejournal.com/101-quick-seo-tips/180563/</a>. <a href="#fnref:22" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:23"> <p>. “10 SEO Tips For Your Podcast.” Podcast Motor. September 22, 2015. <a href="https://www.podcastmotor.com/seo-tips-podcast/">https://www.podcastmotor.com/seo-tips-podcast/</a>. <a href="#fnref:23" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:24"> <p>Hoyt, E., Hughes, K., and Acland, C.R. “A Guide to the Arclight Guidebook.” In C.R. Acland and E. Hoyt (eds), <em>The Arclight Guidebook to Media History and the Digital Humanities</em> , REFRAME/Project, Falmer (2016): pp. 1-29. <a href="#fnref:24" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:25"> <p>It should be noted that the ability to obtain duration data is not currently available to front-end users of the PodcastRE site. <a href="#fnref:25" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:26"> <p>Barbaro, M. “Get Ready for The Daily, Your Audio News Report,” <em>The New York Times</em> (2017): <a href="https://www.nytimes.com/2017/01/30/podcasts/the-daily-get-ready-for-the-daily-your-audio-news-report.html">https://www.nytimes.com/2017/01/30/podcasts/the-daily-get-ready-for-the-daily-your-audio-news-report.html</a>, accessed November 30, 2018. <a href="#fnref:26" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:27"> <p>Jerde, S. “How NYT’s The Daily Grew to 5 Million Monthly Listeners and Became a Breakout Star,” Ad Week (2018): <a href="https://www.adweek.com/digital/how-nyts-the-daily-grew-to-5-million-monthly-listeners-and-became-a-breakout-star/">https://www.adweek.com/digital/how-nyts-the-daily-grew-to-5-million-monthly-listeners-and-became-a-breakout-star/</a>, accessed November 30, 2018. <a href="#fnref:27" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:28"> <p>Pompeo, J. “‘We Didn’t Expect to Make Money’: How <em>The Daily</em> ’s Michael Barbaro Unexpectedly Became the Ira Glass of <em>The New York Times</em> ,” <em>Vanity Fair</em> (2018): <a href="https://www.vanityfair.com/news/2018/07/how-the-daily-michael-barbaro-became-the-ira-glass-of-new-york-times">https://www.vanityfair.com/news/2018/07/how-the-daily-michael-barbaro-became-the-ira-glass-of-new-york-times</a>, accessed November 30, 2018. <a href="#fnref:28" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:29"> <p>“Up First: The Essential Morning News Podcast From NPR,” <em>NPR</em> (2017): <a href="https://www.npr.org/about-npr/522211062/up-first-the-essential-morning-news-podcast-from-npr">https://www.npr.org/about-npr/522211062/up-first-the-essential-morning-news-podcast-from-npr</a>, accessed November 30, 2018. <a href="#fnref:29" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:30"> <p>Podtrac, “Podcast Industry Audience Rankings,” Podtrac (2018): <a href="http://analytics.podtrac.com/industry-rankings/">http://analytics.podtrac.com/industry-rankings/</a>, accessed November 30, 2018. <a href="#fnref:30" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:31"> <p>“Up First,” <em>NPR</em> (2018): <a href="https://www.npr.org/podcasts/510318/up-first">https://www.npr.org/podcasts/510318/up-first</a>, accessed November 30, 2018. <a href="#fnref:31" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:32"> <p>“The Daily,” <em>The New York Times</em> (2018): <a href="https://www.nytimes.com/column/the-daily">https://www.nytimes.com/column/the-daily</a>, accessed November 30, 2018. <a href="#fnref:32" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:33"> <p><em>Affirmation Nation</em> , <em>Analyze Phish</em> , <em>Andy Daly Podcast Pilot Project</em> , <em>The Apple Sisters</em> , <em>Bitch Sesh</em> , <em>Comedy Bang! Bang!</em> , <em>Eardrop</em> , <em>Earwolf Challenge</em> , <em>Fogelnest Files</em> , <em>Glitter in the Garbage</em> , <em>Hard Nation</em> , <em>Hello From The Magic Tavern</em> , <em>Hollywood Handbook</em> , <em>How Did This Get Made</em> , <em>Improv4Humans</em> , <em>Kevin Pollak’s Chat Show</em> , <em>Mike Detective</em> , <em>Never Not Funny</em> , <em>Off Book</em> , <em>Professor Blastoff</em> , <em>Rafflecast</em> , <em>Ronna and Beverly</em> , <em>Spontaneanation</em> , <em>Throwing Shade</em> , <em>Topics</em> , <em>Totally Laime</em> , <em>U Talkin’ U2 2 Me</em> , <em>Who Charted</em> , <em>With Special Guest Lauren Lapkus</em> , and <em>Womp It Up!</em> <a href="#fnref:33" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:34"> <p><em>Chompers</em> , <em>Crimetown</em> , <em>Every Little Thing</em> , <em>The Habitat</em> , <em>Heavyweight</em> , <em>Homecoming</em> , <em>Mogul</em> , <em>Mystery Show</em> , <em>The Nod</em> , <em>The Pitch</em> , <em>Reply All</em> , <em>Sampler</em> , <em>Sandra</em> , <em>Science Vs.</em> , <em>StartUp</em> , <em>Surprisingly Awesome</em> , <em>Twice Removed</em> , <em>Uncivil</em> , and <em>Undone</em> . <a href="#fnref:34" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:35"> <p>Earwolf, “About Earwolf,” Earwolf (2018): <a href="https://www.earwolf.com/about/">https://www.earwolf.com/about/</a>, accessed November 30, 2018. <a href="#fnref:35" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:36"> <p>Gimlet Media, “About,” Gimlet Media (2018): <a href="https://www.gimletmedia.com/about">https://www.gimletmedia.com/about</a>, accessed November 30, 2018. <a href="#fnref:36" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:37"> <p>Though most blogs on the subject recommend tying duration to whatever length your content demands, they also routinely recommend shorter average durations, with <em>We Edit Podcasts</em> , for instance, writing, “it is possible to become successful with a longer show, but in general, the 22 minute rule trumps all” <sup id="fnref:38"><a href="#fn:38" class="footnote-ref" role="doc-noteref">38</a></sup>. <a href="#fnref:37" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> <li id="fn:38"> <p>We Edit Podcasts, “What Is the Optimal Length for a Podcast?” We Edit Podcasts (2016): <a href="https://www.weeditpodcasts.com/what-is-the-optimal-length-for-a-podcast/n">https://www.weeditpodcasts.com/what-is-the-optimal-length-for-a-podcast/n</a>, accessed November 30, 2018. <a href="#fnref:38" class="footnote-backref" role="doc-backlink">↩︎</a></p> </li> </ol> </div> </div> </div> </article> </main> <footer> <nav aria-label="footer links"> <ul> <li><a class="highlight-focus" href="/dhqwords/tags/">Tags</a></li> <li><a class="highlight-focus" href="/dhqwords/about/">About</a></li> <li><a class="highlight-focus" href="/dhqwords/categories/">Keywords</a></li> </ul> </nav> <div class="icons"> <a class="license highlight-focus" rel="license" href="http://creativecommons.org/licenses/by/4.0/"> <img alt="Creative Commons Attribution No-Derivatives 4.0 International License" src="/dhqwords/img/logos/license.svg" width="120" height="42"/></a> </div> </footer> </body> </html>