International Journal of Communication 20(2026) Short Video Format as Ubiquitous Artifact
Università della Svizzera italiana, Switzerland
Short videos first emerged as a core feature of visual-centric platforms and have rapidly expanded across nearly all online spaces, challenging traditional audiovisual norms and reshaping how content is created, consumed, and engaged with. This paper argues that a cross-platform conceptualization is necessary to define how short videos and their features generate meaning, distinguish them from other audiovisual artifacts, facilitate comparison across studies, and support cumulative research. The infrastructure of short videos will continue evolving, shaped by platform objectives and user demands. The conceptual framework introduced in this study allows for tracking and comparing these changes across diverse environments, highlighting how short videos shape platform aesthetics and user practices through mobile-based technological standards, affordances, and the interplay of multiple communication modes. By doing so, this framework provides a solid theoretical foundation for developing tailored methodologies capable of grasping the complex components of this format.
Keywords: short videos, short video format, visual platforms, visual communication, social media
Giulia Isabella Guerra: [email protected]
Date submitted: 2025-06-20
Short videos significantly shape users’ experience in current platform ecosystems. Their design aligns with smartphone interfaces and intended use, offering immersive consumption (Mulier et al., 2021; Napoli, 2016; Ryan, 2018) and facilitating creative production (Bahiyah & Wang, 2020; Schellewald, 2021; Zeng & Abidin, 2021). This configuration is particularly relevant in a context of content abundance (Boczkowski, 2021) and growing content consumption via smartphones (Newman et al., 2024), as it contributes to maximizing instant engagement.
Short videos were initially designed for creating user-generated content in highly visual and mobile-first platforms like Snapchat, Instagram, and TikTok, showcasing self-representation, everyday practices, and entertainment (Bayer et al., 2016; Napoli, 2016; Neal & Ross, 2018; Schellewald, 2021; Vandersmissen et al., 2014). Over time, short videos have shaped the aesthetics and practices of mobile communication, with most platforms adopting this format, which is increasingly used by companies (Ge et al., 2021), news providers (Klug & Autenrieth, 2022), educators (Carpenter et al., 2024), politicians (Cervi et al., 2021; Zamora-Medina et al., 2023), professional digital creators, and influencers (Abidin, 2020).
Despite growing research on short videos, a comprehensive conceptualization remains lacking, and most studies explore these videos in relation to the design and practices typical of individual platforms like TikTok, missing a cross-platform perspective. However, conceptualization is central to theory building because it links abstract ideas to empirical cases, distinguishes the object of study from others, specifies its meaning and properties, and creates shared knowledge that enables comparison across studies conducted in different settings. In this way, it supports rigorous and cumulative research over time (McLeod & Pan, 2005), which is crucial in the current fragmented state of research on short videos.
This study aims to address this gap by conceptualizing short videos as a distinctive format of visual platforms, resulting from the relation among technical standards, user behavior, and situational context (Jancovic et al., 2019; Sterne, 2012; Volmar, 2017). Building on research grounded in visual communication and multimodality, this article outlines the structural characteristics that define this format, situating them within the growing prominence of visual modalities and mobile video technologies in social media. A conceptual framework that treats short videos as a distinctive format, rather than a platform feature, illustrates how their structure and the interplay of diverse modes of communication shape mobile aesthetics and user practices.
In the concluding section, I reflect on the methodological challenges of analyzing mobile audiovisual products and outline future research directions. In this regard, a format-based conceptualization supports tailored methodologies that are valid across platforms and are needed to examine in depth how the format’s elements interact, generating meaning for platforms and users.
Social Media as Visual Media
Social media are largely visual media, in fact becoming “image and video centered social media platforms” or “visual-centric social media platforms” (Muñoz & Towner, 2017, p. 291). As long-standing platforms have integrated more visual elements in their architectures, the visual became the dominant mode in these spaces (Rettberg, 2017) and “a critical concern for social media research” (Highfield & Leaver, 2016, p. 48). In recent years, user attention has shifted toward visual platforms that allow users to create and share primarily or exclusively images and videos, especially Instagram and TikTok (Kemp, 2025). The rapid advancement of digital and mobile technologies has indeed facilitated the production and sharing of a huge amount of visual data (Lobinger, 2017).
While some platforms were designed to be visual-first, including Snapchat, Instagram, and TikTok, others like X (formerly Twitter) incorporated visual features later in response to evolving user demands. Indeed, when Twitter was launched in 2006, it was a text-based microblogging platform allowing 140-characters posts. Only starting from June 2011, users could attach images directly to tweets (Rao, 2011). In contrast, Snapchat was introduced in July 2011 as a visual-first platform enabling users to share ephemeral photos and videos that vanished after being viewed (Bayer et al., 2016). This innovation influenced competitors: Instagram introduced video sharing in June 2013 and Stories in August 2016, which disappear after 24 hours, mimicking Snapchat; Twitter introduced direct video uploading in June 2015 (Hern, 2015). In 2021, YouTube introduced Shorts to compete in the growing market of short videos (Sherman, 2021). As this brief timeline highlights, visual modalities spread through platform interdependence, driven by data sharing, service integration, and user migration (Dijck, 2013). Indeed, “social media platforms constitute a form of environment” characterized by distinct rules and practices, where “no two platforms are alike, although many tend to use similar features and functionalities” (Bucher & Helmond, 2018, p. 243). In this regard, moving images proliferate online, often spreading beyond their native platform (e.g., clips from films or TV shows may circulate on social media and vice versa), resulting in formats that do not always adapt to the new hosting channel and may carry new meanings. Short videos constitute a distinctive format on visual platforms because of a combination of structural features involved in processes of production, consumption, and interaction that are inherent to the videos themselves rather than tied to the hosting platforms. These features distinguish short videos from similar media artifacts, thus serving as the foundation for conceptualizing this format (McLeod & Pan, 2005).
As visual platforms continue to rise in popularity, competitors are likely to adapt, reinforcing the shift toward more visually oriented forms of content production and consumption. Consequently, renewed theoretical frameworks and methodologies are essential to examine how visual modalities shape platforms’ architecture and user behavior.
Visual and Multimodal Interaction in Short Videos
Videos are sequences of moving images and can be understood as hybrid media objects (Carroll, 1996) that combine the power of visual with other modes. Movement is thus central and is recognized as a new marker of visual modality (Ravelli & Van Leeuwen, 2018). In multimodal media environments, the visual mode interacts with others, such as sound and language, shaping meaning making, structuring narrative processes, directing viewers’ attention, and influencing the perception of actions, subjects, and spatial relations (Bateman et al., 2017; Van Leeuwen, 2005; Wildfeuer, 2014).
With respect to the short video format, the visual mode is central in consumption and production, and it contributes to shaping platform aesthetics and associated practices. A combination of animations and design optimized for mobile viewing captures users’ attention, creates an immersive experience (Mulier et al., 2021), and shapes the aesthetic of mobile video making (Ryan, 2018). Visual resources help creators establish identity and structure content through recognizable aesthetic choices, such as color schemes and fonts, as demonstrated by Matwick and Matwick (2025) in their studies on food-related Instagram Reels. Similarly, filters assume different roles across platforms; on Instagram, they mediate identity, enable playful or artistic expression, help create social connections, and reproduce cultural norms (Miller, 2024). Instagram filters often encourage the production of content aligned with the polished aesthetics of the platform, while filters on Snapchat encourage rather spontaneous and playful content that people share with their close friends (Schreiber, 2017).
The short video format primarily relies on the visual mode, as it is the precondition for its production and consumption. Other modes, such as sound or text, may be optional, as videos can be created and viewed without audio, captions, or subtitles. Moreover, the interfaces of major short video apps (see Figure 1) present a range of information (e.g., creator, caption, soundtrack, geotags) and the actions users can take (e.g., interacting with the video or recreating it with the same music template or filter), making the experience intuitive and immediate. The visual is indeed a mode of communication based on association, producing sensory immediacy because visuals are perceived holistically and quickly, and their meaning is strongly dependent on context and prior visual experiences (Müller, 2007). For this reason, showing icons and graphics that present the main format-specific affordances, including affordances of customization and interaction, can further encourage users to engage, turning visual perception into participation.
Figure 1. Shared interface affordances of the short video format across four social media platforms (as of December 2025).
Note. Author’s own screenshots and illustration; publicly accessible profiles. From left to right: screenshot of Instagram Reels (Tedone, 2025), TikTok (Khabane, 2025), YouTube Shorts (Zibayi, 2025), and Snapchat Spotlight (Levanti, 2025).
On short videos, the interaction of the visual mode with others such as sound, lyrics, and gesturing supports creative production, imitation, and social practices (Cervi & Divon, 2023; Han & Zappavigna, 2024; Rettberg, 2017; Wang, 2021), enhancing sensorial experience (Matwick & Matwick, 2025). Given the combination of visual and auditory stimuli, which support motivation and perceptual skills, short videos leverage high levels of sensory stimulation (Nicolaou et al., 2019). Thus, meaning making emerges from the multilayered combination of multimodal resources (Grzenkowicz & Wildfeuer, 2025).
Content, Form, and Format: What Is a Short Video?
When opening a social media app on their smartphones, users are highly likely to encounter short videos in their feed. With their growing use (Newman et al., 2024) and adoption by the leading platforms, they have become ubiquitous.
Although several valuable contributions address the topic, a shared conceptualization of short videos is lacking, and terminology varies significantly across studies. Indeed, some studies use the term “video” regardless of platform, whether referring to Vine (Zhang et al., 2014), Snapchat (Schreiber, 2017), Instagram (Muñoz & Towner, 2017), or TikTok (Cervi & Divon, 2023; Rettberg, 2017) or in cross-platform studies (Nielsen & Ganter, 2022). Others adopt “short-form video” for Vine (Vandersmissen et al., 2014) and “short video” for Snapchat (Bayer et al., 2016) and TikTok (Abidin, 2020; Ge et al., 2021; Yaqi et al., 2021; Zeng & Abidin, 2021; Zeng & Kaye, 2022). Varied terminology can also be found within the same study (Anderson, 2020; Lee, 2023). Research on TikTok shows the most diverse terminology, including, for example, “video content” (Klug & Autenrieth, 2022), “short-video content” (Schellewald, 2021; Shutsko, 2020), and “user-generated short-form videos (UGSVs)” (Wang, 2021). Previous research emphasizes the brevity of these videos as the primary feature, but uses the terms “content,” “form,” and “format” interchangeably, despite relevant conceptual differences. Some studies consider TikTok videos as a format, yet they identify affordances and features based on a single-platform perspective (Cervi et al., 2021; Chen et al., 2021; Kaye et al., 2022), which hinders identifying which elements make these videos a distinctive format.
Given the ubiquity of short videos and the need for dedicated approaches for analyzing them, a cross-platform conceptualization is required. Based on McLeod and Pan’s (2005) process of concept explication, which entails (a) analyzing the meaning of a concept, (b) describing its essential qualities, (c) identifying its core dimensions, and (d) prescribing how these dimensions relate to the real world, I first address the definition of short videos as a format and discuss its meaning in social media culture, then identify its structural characteristics and associated user practices, linking them to the platforms in which they operate (p. 25).
Content
In media and communication research, “content” is broadly used as a container term indicating not only the information or meaning of a message but also its form, such as image, text, and sound (McQuail & Deuze, 2020). Within social media scholarship, “content” typically refers to the wide range of materials produced and shared across platforms, including posts, comments, images, videos, and music, along with their associated meanings (Bucher & Helmond, 2018; Dijck, 2013; Gerlitz & Helmond, 2013; Papacharissi, 2009). These media artifacts are not confined to a single platform, but are part of a broader media ecosystem where users actively participate in their creation, distribution, reshaping, and consumption (Jenkins, 2006). This active role of users gives rise to what is commonly referred to as user-generated content (UGC), “the sum of all ways in which people make use of Social Media” (Kaplan & Haenlein, 2010, p. 61), encompassing various materials like “tweets, status updates, blog posts and comments, photographs and videos and so on” (Lupton, 2015, p. 3). Adopting this perspective, short videos are undoubtedly UGC (Shutsko, 2020), but they are also audiovisual media artifacts (Klug, 2020) that structure complex multimodal messages (Rettberg, 2017). With regard to the nature of visual media artifacts, Lobinger (2016) suggests distinguishing between the symbolic dimension of content, referring to the meanings acquired in practices of production, consumption, and exchange, and the material dimension of visual objects, “situated within particular temporal and spatial contexts” (p. 478). Extending this perspective to the other nonvisual and underlying components of the format, we may argue that its complex multimodal nature is something the term “content” alone fails to fully capture.
Form
The term “form” is typically used to describe the visible configuration of platforms and their artifacts, for example, the structure of early social network sites, such as “the form of profile-centric sites” (boyd & Ellison, 2007, p. 216), or blogs as “the earliest form of Social Media” (Kaplan & Haenlein, 2010, p. 63). Wildfeuer (2014) argues that the individual modalities of films should be described “on several levels to reveal their form and meaning” (p. 32), separating the two concepts. Similarly, Aiello and Parry (2019) distinguish between the form and content of images:
Images are produced via various means (drawing, photography, animation, digital effects) and so their “content” should be considered alongside their form and medium. The choices made regarding representational form and content work together to produce meanings through visual means (in interplay with other modes of communication). (p. 17)
Therefore, defining short videos by their short form highlights their limited duration, yet a comprehensive understanding of their transformative role on visual platforms must also account for underlying elements that are not immediately observable.
Format
While some studies already recognize short videos as a format (Canella, 2018; Cervi et al., 2021; Kaye et al., 2022; Mulier et al., 2021; Napoli, 2016), a consistent conceptualization remains elusive. Format theory emphasizes that the creation, distribution, and consumption of media are deeply influenced by formats: “Format denotes a whole range of decisions that affect the look, feel, experience, and workings of a medium” and “a set of rules according to which a technology can operate” (Sterne, 2012, p. 7). Formats enable information transmission across infrastructures since they “consist of specific sets of descriptions and requirements of how to arrange and present information” and “ensure interoperability across diverse industries and ecologies of media devices” (Jancovic et al., 2019, p. 7). Formats define not only media file type, for example, MP3 for audio and MP4 for videos, but also their underlying structure, aesthetics, the ability to function across different systems, and the intended ways of using them. While platforms provide affordances that enable and constrain user activities (Bucher & Helmond, 2018), formats also embody specific affordances that mediate interactions among users and platforms (Volmar, 2017), shaping communicative and sociocultural practices. For this reason, they are considered “critical nodes of media culture” (Jancovic et al., 2019, p. 10). For example, Dijck (2013) describes the tweet as a textual “global format” whose structural characteristics, particularly the 140-character limit and the use of hashtags, influenced the “online culture” and journalism practices (p. 76). Changes in media formats reshape usage patterns. For instance, the rise of mobile video technology and vertical framing has challenged traditional filmmaking norms, encouraging amateur content creation (Napoli, 2016; Ryan, 2018), while posing challenges to professional video makers who need to integrate this format into conventional media like local television (Canella, 2018). Thus, adopting a format often reflects strategic choices beyond aesthetics, driven by technological, economic, or functional needs (Jancovic et al., 2019).
Given this, “format” comprehensively encompasses the representation and underlying structure of a media artifact, or form, its associated information, or content, and the cultural practices embodied in its usage. Thus, “short video format” best indicates the complex nature of this media artifact.
Structural Characteristics of the Short Video Format
My conceptualization of short videos as a format is grounded on the relation among operational standards, affordances, intended uses, and situational context, as proposed by Sterne (2012). Indeed, Sterne indicates a comprehensive approach to analyzing formats that considers technical specifications (e.g., infrastructures, codes, and operational standards), political and institutional forces involved in formats’ standardization and distribution, user behavior and perception, and the historical legacy of prior technologies and practices that drove formats’ development. Similarly, Entrena-Serrano (2025) argues that users’ ability to curate the content they consume on social media (“consumptive curation affordances”) is co-shaped by three interlinked forces: technological design of platforms, user practices, and broader social arrangements (economic, cultural, and regulatory contexts).
The short video format is thus shaped by the architecture of platforms, that is, their underlying technical structure and design, which evolve constantly (boyd & Ellison, 2007; Bucher & Helmond, 2018; Couldry & Hepp, 2018; Dijck, 2013; Papacharissi, 2009), making it challenging to define stable characteristics across time and platforms. Nevertheless, the analysis of existing literature reveals nine structural characteristics that constitute the basis of the conceptual framework for the short video format: the native spaces in which short videos originate and spread, their technical infrastructure, their brief length and vertical display aligned with smartphone interfaces, creative customization functions and auditory affordances, interactive features and cross-platform transitions that enhance user engagement, and versatility in conveying diverse communicative genres. These characteristics are discussed in detail below.
Native Visual Spaces and Infrastructure
The native spaces of short videos are “short video platforms” (Schellewald, 2021; Yang et al., 2019; Yaqi et al., 2021; Zeng & Abidin, 2021; Zeng & Kaye, 2022) or “video-sharing social networking service” (Lee, 2023, p. 408), which are designed to facilitate fast mobile consumption of audiovisual content. The architecture of these platforms, primarily conceived for mobile consumption of images and videos, has influenced the technological standards of the short video format, which has integrated new functionalities while migrating across applications. Vine and Snapchat pioneered short videos in 2013. Vine allowed six-second looping clips in a square (1:1) format (Vandersmissen et al., 2014; Zhang et al., 2014), which did not fully suit smartphones’ vertical displays. Vertical videos, in contrast, were introduced by Snapchat in My Story and were designed to disappear after 24 hours, popularizing ephemeral content (Bayer et al., 2016). Instagram followed with 15-second Stories in 2016, while YouTube enabled uploads of vertical videos in 2015 (Neal & Ross, 2018). TikTok, launched in 2017, redefined this format, making vertical short videos permanent and central to the platform. Its popularity, especially among young audiences (Yang et al., 2019), influenced other competitors; therefore, Instagram launched Reels in 2020, and YouTube introduced Shorts in 2021. Google began testing a Short Videos tab in its search menu in April 2024 (Schwartz, 2024), making it permanent in 2025, while Netflix is testing a mobile-only vertical feed for short videos in its iOS and Android apps (Forristal, 2025), highlighting the format’s growing relevance.
The technical infrastructure of short videos is similar across visual platforms. To watch short videos, users need a stable Internet connection with at least 1 Mbps of bandwidth, which most mobile networks provide (G. Zhang et al., 2021). Insufficient bandwidth affects seamless video streaming, which may lag or fail to load properly. Content creation relies on mobile software development kits and application programming interfaces, enabling video recording and editing with filters and music. After a video is created, it is compressed on the user’s device using standard video formats like H.264 or H.265, uploaded to the platform, and processed. The video is transcoded into various resolutions (e.g., 480p, 720p, and 1080p) using codecs such as H.264. Then a preview image or thumbnail is generated, and metadata such as file type, resolution, and geotags are extracted (Kim et al., 2021). Videos are stored using object storage services and delivered through content delivery networks, which quickly send videos from servers close to the viewer’s geographical location. A complex architecture of microservices manages feeds, notifications, analytics, and more (Sinha, 2024). To ensure smooth playback, streaming relies on adaptive bitrate streaming, which modifies video quality depending on the user’s Internet connection (Kaplunovich & Kaplunovich, 2023). Finally, security mechanisms moderate content (Zeng & Kaye, 2022), while recommendation systems personalize platforms’ feeds using machine learning models that aim to predict what users prefer to see based on their behavior (Zulli & Zulli, 2022).
The infrastructure of short videos is expected to continue evolving, shaped by platform objectives as well as user practices and demands. A clear conceptualization allows for tracking these changes across diverse settings and building cumulative research (McLeod & Pan, 2005).
Length
The short video format originated within platforms like TikTok, designed for “short-lived consumption” (Schellewald, 2021, p. 1437). “Short” refers to both video length and consumption habits: Short videos typically appear in scrollable feeds, such as Instagram Stories and YouTube Shorts, where users watch multiple videos in a short time. Often, these videos automatically play without requiring users to click or scroll, a strategy pioneered by Vine and adopted by competitors (Anderson, 2020). This seamless autoplay fosters “batch view” (Zhang et al., 2014, p. 87), allowing users to watch multiple clips effortlessly, making skipping difficult and keeping engagement high. TikTok adopts a similar mechanism: “watch, scroll, repeat” (Entrena-Serrano, 2025, p. 10). Scrolling creates a continuous content flow that can be both engaging and exhausting, as users may scroll compulsively, driven by fear of missing out (Lupinacci, 2021), yet scrolling can reinforce serendipitous discovery (Kiddle et al., 2023), counterbalancing algorithmic recommendations that potentially create filter bubbles and echo chambers. Conversely, serendipitous discovery enables users to find unexpected, meaningful content that may shift perspectives or spark new interests (Reviglio, 2019). In this regard, content flow on Douyin has been found to enhance serendipitous discovery by offering vivid, engaging short videos combined with cognitive lock-in (Yang et al., 2023), while TikTok leverages recommendation algorithms that preclude direct user control over content selection, offering surprising content and keeping users engaged over time (Zhao & Wagner, 2023).
Content flow is embodied in video length since videos under three minutes attract higher engagement (Guo et al., 2014), reflecting users’ decreasing attention (Rajendran et al., 2024). Short videos’ length varies across platforms and over time; as of 2025, TikTok allows recording clips from 15 seconds to 10 minutes (TikTok Support, 2025); the maximum length of Instagram Reels was extended to three minutes, doubling the previous limit of 90 seconds (Instagram Help Center, 2025); and YouTube Shorts expanded the length from 60 seconds to three minutes (YouTube Help, 2025). Videos up to 60 seconds typically perform better; on YouTube, for instance, Shorts attract more views and likes per view than longer videos (Violot et al., 2024), while on Kuaishou, one of the largest short video platforms in China, short videos get fewer clicks per view than long videos, but they often receive more likes and shares, probably because of algorithmic recommendations (Chen et al., 2024). Despite longer video limits, TikTok for Business (2021) recommends that marketers produce in-feed advertisements of 21–34 seconds. Supporting this, Salminen et al. (2024) found that 82.5% of users stop watching TikTok ads within the first quarter of a video averaging 56.7 seconds. These findings align with TV advertising research. Singh and Cole (1993) demonstrated that 15-second ads are most cost-effective for informational content, whereas 30-second ads better suit emotional messaging to enhance brand recall and attitudes.
Videos lasting several minutes demand greater attention from users, which conflicts with fast-paced scrolling and compulsive consumption. Nevertheless, creators usually adapt video lengths to their needs, with many exceeding one minute for storytelling or in-depth reviews. Hence, length alone does not define the short video format, but it shapes how platforms control information flow, content consumption, and user engagement.
Interactive Features and Cross-Platform Transitions
Short videos are social and cultural products, as their affordances shape interactions and content sharing within and beyond native platforms (Nielsen & Ganter, 2022; Schellewald, 2021; Wang, 2021). These format-specific affordances combined with users’ agency and “popularisation of behaviours” (Highfield & Leaver, 2016, p. 51), structure and stabilize cooperative practices between users and platforms. This is an essential function of media formats, which create shared infrastructures in which platform-specific content, aesthetics, and symbols circulate and become established (Volmar, 2017).
Functions enabling praising, commenting, and sharing increase sociability, the inclination of users to connect with others (Ge et al., 2021). Social interactions are a major factor driving users to engage with apps such as TikTok, alongside activities such as archiving, which involves saving and curating videos; self-expression, which entails sharing personal content; escapism, understood as seeking relaxation or distraction from daily life; and peeking, which refers to observing the lives of others (Bahiyah & Wang, 2020). Apps like TikTok help users connect easily with others, yet technical factors like personalization and entertainment, together with social factors such as interaction anxiety and isolation, can increase app dependence and attachment (Zhang et al., 2019). Users typically share videos by clicking the “share” icon, a right-pointing arrow located in the bottom-right corner of most platforms (see Figure 1). TikTok users, for example, can share a video in app or through external channels, including e-mail or platforms like WhatsApp and Facebook (Zulli & Zulli, 2022). The standardized MP4 video format facilitates content sharing across platforms, enabling compatibility and smooth playback.
Ultimately, interacting features foster a do ut des exchange between creators, who produce content to attract audience’s attention and maintain visibility, and followers, who spend time consuming this content and interacting through likes, comments, and shares, gaining free access to a large volume of content that can be easily consumed at any time.
Vertical Display
Most apps adopted vertical displays for video content, reflecting how users access social media via mobile devices such as smartphones. This challenged the traditional horizontal norms of film and television, creating a new aesthetic paradigm aligned with users’ habits of watching and recording videos on their smartphones (Napoli, 2016; Neal & Ross, 2018; Ryan, 2018). Film and television traditionally employed horizontal formats (4:3 and 16:9, respectively) designed for large screens, such as cinemas and TVs, maximizing peripheral vision and immersive experiences for wide scenes and landscapes. In contrast, vertical video (9:16) was relegated to amateur genres (Mulier et al., 2021; Napoli, 2016), yet it has now become standard for mobile video creation also in professional contexts (Canella, 2018).
The viewing experience in vertical format is perceived as more intimate, immersive, and immediate (Mulier et al., 2021; Ryan, 2018), aligning with the personal nature of smartphones and their usage. It constitutes a form of “small-screen image making, where expansive horizons lose detail and clarity, and the intimate and the personal rule” (Ryan, 2018, p. 257). In terms of shooting, vertical video emphasizes close-up and portrait-style framing rather than wide, expansive shots typical of horizontal orientations. This approach impacts composition, often centering the subject to enhance viewer engagement and highlight individual moments rather than panoramic scenes (Napoli, 2016; Ryan, 2018). While this aesthetic resonates strongly with social media contexts, it draws on a longer history in which the “landscape” and “portrait” formats have always “lived together” (Napoli, 2016, p. 47).
The short video format adopts a vertical display to align with smartphone-based technologies; indeed, “users make vertical video because it makes sense within the technology” (Ryan, 2018, p. 258). This relates to format theory, which defines a format as “a set of rules according to which a technology operates” (Sterne, 2012, p. 7). Similarly, the aesthetics and practices associated with short videos reflect the ability of formats “to stabilize practices and forge collectives” (Jancovic et al., 2019, p. 11).
The short video format is thus an audiovisual media object composed of a set of moving images, typically adopting a 9:16 aspect ratio, the default vertical display designed for optimal viewing on smartphones. The shift from horizontal to vertical format challenges traditional shooting and compositional norms, introducing a new aesthetic aligned with the intimate, personal experience of consuming mobile media and fostering a distinctive visual language in video production and consumption.
Creative Customization Functions
The short video format features creative customization functions directly provided in app that combine different modes of communication, thereby facilitating content creation, boosting creativity (Shutsko, 2020), and shaping sociocultural practices (Cervi & Divon, 2023; Zeng & Abidin, 2021). Editing functions, which traditionally existed in separate programs and applications, are now directly integrated in app, offering more accessible customization capabilities (Gerlitz et al., 2019; Wang & Gu, 2022). This allows users to control and modify modalities on their content, an ability formerly reserved for experts (Ravelli & Van Leeuwen, 2018).
Editing typically occurs at two stages in most visual platforms: production and postproduction (see Figure 2). During the production stage, users edit their content while recording the video. For instance, they can apply effects to adjust lighting, set a maximum clip duration, control speed, add a green screen or background audio, and more. After generating the clip, users can preview it before sharing. In this preview window, they can make additional changes, such as cutting scenes, adding subtitles or music, and much more. Creators can also customize content during posting by adding textual elements like captions, geotags, and hashtags that are often used to increase visibility through algorithms. Thus, creative customization functions contribute to making platforms like TikTok dynamic spaces, transforming the app’s use into a “playful” and “gamified” activity (Schellewald, 2021, p. 1448). However, these editing possibilities are not unlimited, but rather aligned with each platform’s aesthetics and objectives, thereby shaping sociocultural practices and associated meanings (Schreiber, 2017). In this regard, since TikTok follows an imitative logic, users can replicate similar videos using standardized and reusable auditory and visual effects, communicating values and expressing identities (Zeng & Abidin, 2021). Users are thus encouraged to interact with each other by adopting the same visual languages and styles.
Figure 2. Author’s screenshots illustrating the editing functions available in the TikTok mobile application during the production and postproduction stages (as of November 2025).
Note. Screenshots were taken for illustrative purposes only.
Auditory Affordances
Sound is a semiotic mode that contributes to meaning making in multimodal artifacts, using resources such as rhythm, volume, pauses, and timbre to convey emotions, actions, or structure narratives. Sound often interacts with other modes, such as visual and textual, guiding audience attention, emotions, and interpretations (Van Leeuwen, 1999, 2005). In multimodal artifacts, music and visuals have a long history of connections and are increasingly prevalent in contemporary media, such as music videos (Bateman et al., 2017). Short videos fit within this context, as they display distinctive audiovisual features that combine synchronized music, gestures, movements, and camera effects into performative expressions (Klug, 2020).
Video-based platforms typically enable users to record or upload clips with original audio, allowing for spoken content with natural background sounds. Alternatively, users can replace the original audio with other sounds available on the app, such as music tracks or popular songs, as illustrated in Figure 2. On Instagram Reels, the interaction of different sound modes such as music, background sounds, and voiceover constructs an immersive and performative narrative, conferring dramatic effects and transforming videos into hyper-sensorial experiences (Matwick & Matwick, 2025). On platforms like TikTok, sound effects support self-presentation and artistic performance, driving popular forms of entertainment such as lip syncing or dancing that often acquire a social meaning for user communities (Klug, 2020; Rettberg, 2017). Reusing and remixing background audio is central to viral trends, functioning as audio memes that enhance imitation and engagement (Zeng & Abidin, 2021). Audio memes enable TikTok creators to shape narratives, convey emotion, time punchlines, and organize trends, often paired with editing techniques to amplify virality. They encompass diverse remix styles and ownership variations, serving as “an organizing principle for how content is catalogued into repositories on TikTok, and how users navigate the platform to seek new trends and contents” (Abidin, 2020, p. 80).
Furthermore, most social media apps enable users to add subtitles to videos in postproduction. This function underscores the role of visuals in enhancing comprehension and conveying meaning in video technologies, allowing people to watch and understand content in contexts where audio is either impractical (e.g., noisy environments) or inappropriate (e.g., medical waiting rooms). Subtitles ensure inclusivity for viewers with hearing impairments and can be customized for users with visual impairments. Additionally, they support non-native speakers, promote social integration, and help increase viewer engagement and the overall experience (Agulló et al., 2019). W. Zhang et al. (2021) found that subtitles in TikTok videos of public hospitals in China enabled viewers to immediately grasp the main ideas and facilitated understanding of the health information provided. The combined use of audio and subtitles makes short videos accessible and comprehensible to a wide audience, two essential requirements for media formats (Jancovic et al., 2019). In the short video format, auditory affordances thus support creative production, user engagement, and content fruition.
Versatility of Communicative Genres
Short videos emerged as a more engaging alternative to traditional video formats, reflecting the tastes of young audiences, fitting fragmented viewing habits, and fostering grassroots creativity (Yang et al., 2019). By integrating multiple modes of communication and enabling editing and personalization, the short video format becomes versatile, conveying diverse communicative genres and fulfilling various social functions (Van Leeuwen, 2005).
Entertainment is the most prominent communicative genre, yet its centrality varies across platforms. For instance, entertainment-related categories dominate YouTube Shorts, while educational, political, and artistic content is typical of longer YouTube videos (Violot et al., 2024). Conversely, TikTok primarily offers entertainment, artistic performances, and self-expression, aligning with its younger audiences (Schellewald, 2021; Shutsko, 2020; Zeng & Abidin, 2021). Shutsko (2020) showed that comedy, musical performances, DIY tutorials, and dance constitute the majority of UGC on TikTok, while Schellewald (2021) identified six overarching communicative forms—comedic, documentary, communal, interactive, explanatory, and meta—emphasizing that typical short video content filters reality through expressive forms of comedic entertainment, addressing topics like religion, politics, global events, and individual life circumstances in a lighter yet no less meaningful way. Moreover, content that shares knowledge, including tutorials and life hacks, enriches the social experience of TikTok, along with popular audio trends and challenges that encourage playful engagement and skills showcasing among users. Entertaining short videos are therefore not merely “silly content”; they give rise to complex forms of communication (Schellewald, 2021), including self-expression, talent showcasing, and user engagement through shared interests, reflecting a performative social media experience (Shutsko, 2020) that is also community-building (Zeng & Abidin, 2021).
Short videos also support professional content. Creators present scientific information (Zeng et al., 2021), journalistic news (Klug & Autenrieth, 2022), and educational material (Carpenter et al., 2024), often using entertaining features like duets or stitches, enhancing engagement. Politicians use short videos to gain visibility among younger audiences (Cervi et al., 2021; Zamora-Medina et al., 2023), as do digital activists who exploit playful affordances to promote their causes (Cervi & Divon, 2023). Additionally, short video advertisements offer vivid and low-cost promotion for sellers (Ge et al., 2021).
Short video content must comply with platforms’ rules and national regulatory frameworks. Sensitive or disturbing topics are prohibited and banned by moderators, whose work is increasingly important, as “moderating video content is more technologically demanding and time-consuming” (Zeng & Kaye, 2022, p. 80). Platforms typically avoid controversial or inappropriate content, though Shutsko (2020) observed frequent cases of potential violation of copyright and privacy on TikTok in Germany, according to the national law. Despite content moderation challenges, the short video format has become central for conveying diverse communicative genres on visual platforms, especially among younger audiences, sharing knowledge and supporting varied forms of expression.
Conclusions and Reflections for Future Research
This study conceptualizes the short video format and outlines its structural characteristics in a cross-platform perspective, addressing how their technological underpinnings and the interplay of different modes of communication shape meaning for users. I argue that short videos should be understood as a format, not merely as “content,” which refers to the information and meanings conveyed, nor solely as a “form” of video, which describes their visible configuration and structure.
The short video format is an audiovisual artifact composed of moving images arranged sequentially, produced and consumed primarily on visual platforms. It adopts a technical infrastructure, operative standards, a full-screen vertical display, and a limited length designed to enhance rapid and immersive consumption via mobile devices. Through creative customization functions and the interplay of visual and auditory affordances, this format shapes platforms’ aesthetics and supports varying content creation, conveying diverse communicative genres. The interactive features and cross-platform transitions shape user behavior and engagement, enhancing sociability, playful experiences, and collective meaning making, both inside and outside native platforms. The rise of this format challenges traditional audiovisual norms while facilitating new user practices that redefine social media communication.
The growing relevance of short videos has fostered an emerging body of research aimed at developing and readapting methods of analysis (e.g., Cervi & Divon, 2023; Grzenkowicz & Wildfeuer, 2025; Wang, 2021). It must be emphasized that analyzing moving images is challenging, as it requires adapting existing methods to the format’s multimodal nature and its mobile-based technologies. Understanding how short videos mediate messages and produce meaning also demands consideration of the sociocultural context and the platform-specific practices through which users engage with them (Kaye et al., 2022). The conceptual framework introduced in this work is situated within this context: By identifying the distinctive characteristics of the format across platforms, it allows studies approaching the concept from different perspectives or research settings to engage in dialogue and be compared, thereby reducing complexity and supporting cumulative, rigorous research (McLeod & Pan, 2005). A format-based conceptualization thus facilitates the development of tailored methodologies that extend beyond individual platforms, allowing a comprehensive analysis of the format’s elements (Sterne, 2012). This approach further supports a deeper understanding of how visual culture evolves within social media environments (Reißmann et al., 2023), emphasizing the interplay of technological standards, audiovisual characteristics, and affordances in generating meaning for users.
References
Abidin, C. (2020). Mapping Internet celebrity on TikTok: Exploring attention economies and visibility labours. Cultural Science Journal, 12(1), 77–103. https://doi.org/10.5334/csci.140
Agulló, B., Montagud, M., & Fraile, I. (2019). Making interaction with virtual reality accessible: Rendering and guiding methods for subtitles. Artificial Intelligence for Engineering Design, Analysis and Manufacturing, 33(4), 416–428. https://doi.org/10.1017/S0890060419000362
Aiello, G., & Parry, K. (2019). Visual communication: Understanding images in media culture. Thousand Oaks, CA: SAGE Publications.
Anderson, K. E. (2020). Getting acquainted with social networks and apps: It is time to talk about TikTok. Library Hi Tech News, 37(4), 7–12. https://doi.org/10.1108/LHTN-01-2020-0001
Bahiyah, O., & Wang, D. (2020). Watch, share or create: The influence of personality traits and user motivation on TikTok mobile video usage. International Journal of Interactive Mobile Technologies (iJIM), 14(4), 121–137. https://doi.org/10.3991/ijim.v14i04.12429
Bateman, J. A., Wildfeuer, J., & Hiippala, T. (2017). Multimodality: Foundations, research and analysis—A problem-oriented introduction. Berlin, Germany: De Gruyter.
Bayer, J. B., Ellison, N. B., Schoenebeck, S. Y., & Falk, E. B. (2016). Sharing the small moments: Ephemeral social interaction on Snapchat. Information, Communication & Society, 19(7), 956–977. https://doi.org/10.1080/1369118X.2015.1084349
Boczkowski, P. J. (2021). Abundance: On the experience of living in a world of information plenty. New York, NY: Oxford University Press.
boyd, d. m., & Ellison, N. B. (2007). Social network sites: Definition, history, and scholarship. Journal of Computer-Mediated Communication, 13(1), 210–230. https://doi.org/10.1111/j.1083-6101.2007.00393.x
Bucher, T., & Helmond, A. (2018). The affordances of social media platforms. In J. Burgess, A. Marwick, & T. Poell (Eds.), The Sage handbook of social media (pp. 233–253). London, UK: SAGE Publications. https://doi.org/10.4135/9781473984066.n14
Canella, G. (2018). Video goes vertical: Local news videographers discuss the problems and potential of vertical video. Electronic News, 12(2), 75–93. https://doi.org/10.1177/1931243117705417
Carpenter, J. P., Morrison, S. A., Shelton, C. C., Clark, N., Patel, S., & Toma-Harrold, D. (2024). How and why educators use TikTok: Come for the fun, stay for the learning? Teaching and Teacher Education, 142, Article 104530. https://doi.org/10.1016/j.tate.2024.104530
Carroll, N. (1996). Theorizing the moving image. New York, NY: Cambridge University Press.
Cervi, L., & Divon, T. (2023). Playful activism: Memetic performances of Palestinian resistance in TikTok #challenges. Social Media + Society, 9(1), 1–13. https://doi.org/10.1177/20563051231157607
Cervi, L., Tejedor, S., & Marín Lladó, C. (2021). TikTok and the new language of political communication. Cultura, Lenguaje y Representación, 26, 267–287. https://doi.org/10.6035/clr.5817
Chen, X., Kaye, D. B. V., & Zeng, J. (2021). #PositiveEnergyDouyin: Constructing “playful patriotism” in a Chinese short-video application. Chinese Journal of Communication, 14(1), 97–117. https://doi.org/10.1080/17544750.2020.1761848
Chen, Z., Liu, P., Piao, J., Xu, F., & Li, Y. (2024). Shorter is different: Characterizing the dynamics of short-form video platforms. arXiv. https://doi.org/10.48550/arXiv.2410.16058
Couldry, N., & Hepp, A. (2018). The mediated construction of reality. Cambridge, UK: Polity.
Dijck, J. van. (2013). The culture of connectivity: A critical history of social media. New York, NY: Oxford University Press. https://doi.org/10.1093/acprof:oso/9780199970773.001.0001
Entrena-Serrano, C. (2025). Watch, scroll, repeat: How interface design shapes consumptive curation affordances on TikTok. Social Media + Society, 11(3), 1–16. https://doi.org/10.1177/20563051251358529
Forristal, L. (2025, May 7). Netflix is getting into short videos with a new vertical feed for mobile. TechCrunch. https://techcrunch.com/2025/05/07/netflix-is-getting-into-short-videos-with-a-new-vertical-feed-for-mobile/
Ge, J., Sui, Y., Zhou, X., & Li, G. (2021). Effect of short video ads on sales through social media: The role of advertisement content generators. International Journal of Advertising, 40(6), 870–896. https://doi.org/10.1080/02650487.2020.1848986
Gerlitz, C., & Helmond, A. (2013). The like economy: Social buttons and the data-intensive web. New Media & Society, 15(8), 1348–1365. https://doi.org/10.1177/1461444812472322
Gerlitz, C., Helmond, A., Van der Vlist, F., & Weltevrede, E. (2019). Regramming the platform: Infrastructural relations between apps and social media. Computational Culture, 7. https://computationalculture.net/regramming-the-platform/
Grzenkowicz, M., & Wildfeuer, J. (2025). Addressing TikTok’s multimodal complexity: A multi-level annotation scheme for the audio-visual design of short video content. Digital Scholarship in the Humanities, 40(4), 1143–1166. https://doi.org/10.1093/llc/fqaf047
Guo, P. J., Kim, J., & Rubin, R. (2014). How video production affects student engagement: An empirical study of MOOC videos. In Proceedings of the First ACM Conference on Learning @ Scale Conference (pp. 41–50). New York, NY: Association for Computing Machinery. https://doi.org/10.1145/2556325.2566239
Han, J., & Zappavigna, M. (2024). Multimodal rhythm in TikTok videos: Exploring a recontextualization of the Gillard “misogyny speech.” Multimodality & Society, 4(1), 58–79. https://doi.org/10.1177/26349795231207228
Hern, A. (2015, January 27). Twitter launches video uploads and group DM. The Guardian. https://www.theguardian.com/technology/2015/jan/27/twitter-launches-video-uploads-group-dm
Highfield, T., & Leaver, T. (2016). Instagrammatics and digital methods: Studying visual social media, from selfies and GIFs to memes and emoji. Communication Research and Practice, 2(1), 47–62. https://doi.org/10.1080/22041451.2016.1155332
Instagram Help Center. (2025). Record a reel on Instagram. https://help.instagram.com/2720958398006062
Jancovic, M., Volmar, A., & Schneider, A. (Eds.). (2019). Format matters: An introduction to format studies. Lüneburg, Germany: Meson Press.
Jenkins, H. (2006). Convergence culture. New York, NY: New York University Press.
Kaplan, A. M., & Haenlein, M. (2010). Users of the world, unite! The challenges and opportunities of social media. Business Horizons, 53(1), 59–68. https://doi.org/10.1016/j.bushor.2009.09.003
Kaplunovich, A., & Kaplunovich, S. (2023). Consolidating user data from social networks using machine learning and serverless cloud. In 2023 International Conference on Intelligent Computing, Communication, Networking and Services (ICCNS) (pp. 230–236). Valencia, Spain: IEEE. https://doi.org/10.1109/ICCNS58795.2023.10193182
Kaye, D. B. V., Zeng, J., & Wikstrom, P. (2022). TikTok: Creativity and culture in short video. Cambridge, UK: Polity.
Kemp, S. (2025). Digital 2025: Global overview report. DataReportal. https://datareportal.com/reports/digital-2025-global-overview-report
Khabane, L. S. [@khaby. lame]. (2025, March 25). I got you #learnfromkhaby #comedy [Video]. TikTok. https://www.tiktok.com/@khaby.lame/video/7485735249675177238
Kiddle, R., Welbers, K., Kroon, A., & Trilling, D. (2023). Enabling serendipitous news discovery experiences by designing for navigable surprise. In S. Vrijenhoek, L. Michiels, J. Kruse, A. Starke, J. Viader Guerrero, & N. Tintarev (Eds.), Proceedings of NORMalize 2023: The First Workshop on the Normative Design and Evaluation of Recommender Systems (pp. 1–8). Singapore, China: CEUR-WS. https://ceur-ws.org/Vol-3639/short2.pdf
Kim, T., Temiyasathit, P., & Wang, H. (2021, April 5). How Facebook encodes your videos. Engineering at Meta. https://engineering.fb.com/2021/04/05/video-engineering/how-facebook-encodes-your-videos/
Klug, D. (2020). “It took me almost 30 minutes to practice this.” Performance and production practices in dance challenge videos on TikTok. arXiv. https://doi.org/10.33767/osf.io/j8u9v
Klug, D., & Autenrieth, U. (2022). Struggle for strategy. Presence, practices, and communicative strategies of legacy news providers on TikTok. MediArXiv. https://doi.org/10.33767/osf.io/m3ztq
Lee, Y. J. (2023). Language learning affordances of Instagram and TikTok. Innovation in Language Learning and Teaching, 17(2), 408–423. https://doi.org/10.1080/17501229.2022.2051517
Levanti, L. [@louis_levanti]. (2025, November 7). Grammy nominations and predictions #grammys #music [Video]. Snapchat. https://www.snapchat.com/@louis_levanti/spotlight/W7_EDlXWTBiXAEEniNoMPwAAYdGllZmVraHNvAZpfod1XAZpfoPTxAAAAAQ
Lobinger, K. (2016). Photographs as things—Photographs of things. A texto-material perspective on photo-sharing practices. Information, Communication & Society, 19(4), 475–488. https://doi.org/10.1080/1369118X.2015.1077262
Lobinger, K. (2017). Visual research methods. In J. Matthes, C. S. Davis, & R. F. Potter (Eds.), The international encyclopedia of communication research methods (pp. 1–10). Hoboken, NJ: Wiley. https://doi.org/10.1002/9781118901731
Lupinacci, L. (2021). “Absentmindedly scrolling through nothing”: Liveness and compulsory continuous connectedness in social media. Media, Culture & Society, 43(2), 273–290. https://doi.org/10.1177/0163443720939454
Lupton, D. (2015). Digital sociology. Abingdon, UK: Routledge. https://doi.org/10.4324/9781315776880
Matwick, K., & Matwick, K. (2025). Reel cooking: How Instagram reimagines recipe narratives. Food, Culture & Society, 28(5), 1422–1448. https://doi.org/10.1080/15528014.2025.2462407
McLeod, J. M., & Pan, Z. (2005). Concept explication and theory construction. In S. Dunwoody, L. B. Becker, D. M. McLeod, & G. M. Kosicki (Eds.), The evolution of key mass communication concepts (pp. 14–76). Cresskill, NJ: Hampton Press.
McQuail, D., & Deuze, M. (2020). McQuail’s media & mass communication theory (7th ed.). London, UK: SAGE Publications.
Miller, L. A. (2024). Preserving the ephemeral: A visual typology of augmented reality filters on Instagram. Visual Studies, 40(3), 472–485. https://doi.org/10.1080/1472586X.2024.2341296
Mulier, L., Slabbinck, H., & Vermeir, I. (2021). This way up: The effectiveness of mobile vertical video marketing. Journal of Interactive Marketing, 55(1), 1–15. https://doi.org/10.1016/j.intmar.2020.12.002
Müller, M. G. (2007). What is visual communication? Past and future of an emerging field of communication research. Studies in Communication Sciences, 7(2), 7–34. http://doi.org/10.5169/seals-791077
Muñoz, C. L., & Towner, T. L. (2017). The image is the message: Instagram marketing and the 2016 presidential primary season. Journal of Political Marketing, 16(3–4), 290–318. https://doi.org/10.1080/15377857.2017.1334254
Napoli, M. D. (2016). The “mobile effect” on screen format: The case of vertical videos. Journal of Science and Technology of the Arts, 8(2), 45–49. https://doi.org/10.7559/CITARJ.V8I2.169
Neal, D., & Ross, M. (2018). Mobile framing: Vertical videos from user-generated content to corporate marketing. In M. Schleser & M. Berry (Eds.), Mobile story making in an age of smartphones (pp. 151–160). Cham, Switzerland: Springer International Publishing.
Newman, N., Fletcher, R., Robertson, C. T., Arguedas, A. R., & Nielsen, R. K. (2024). Reuters Institute digital news report 2024. Reuters Institute for the Study of Journalism. https://reutersinstitute.politics.ox.ac.uk/digital-news-report/2024
Nicolaou, C., Matsiola, M., & Kalliris, G. (2019). Technology-enhanced learning and teaching methodologies through audiovisual media. Education Sciences, 9(3), Article 196. https://doi.org/10.3390/educsci9030196
Nielsen, R. K., & Ganter, S. A. (2022). The power of platforms: Shaping media and society. Oxford, UK: Oxford University Press. https://doi.org/10.1093/oso/9780190908850.001.0001
Papacharissi, Z. (2009). The virtual geographies of social networks: A comparative analysis of Facebook, LinkedIn and ASmallWorld. New Media & Society, 11(1–2), 199–220. https://doi.org/10.1177/1461444808099577
Rajendran, P. T., Creusy, K., & Garnes, V. (2024). Shorts on the rise: Assessing the effects of YouTube Shorts on long-form video content. arXiv. https://doi.org/10.48550/arXiv.2402.18208
Rao, L. (2011, June 8). Twitter has begun rolling out new photo service to users. TechCrunch. https://techcrunch.com/2011/06/08/twitter-has-begun-rolling-out-new-photo-service-to-users/
Ravelli, L. J., & Van Leeuwen, T. (2018). Modality in the digital age. Visual Communication, 17(3), 277–297. https://doi.org/10.1177/1470357218764436
Reißmann, W., Siemon, M., & Kinoshita, M. (2023). Image networks and practice analysis of larger data corpora. An approach to cluster and recontextualize visual practice in social media. Studies in Communication Sciences, 24(1), 123–139. https://doi.org/10.24434/j.scoms.2024.01.3883
Rettberg, J. W. (2017). Hand signs for lip-syncing: The emergence of a gestural language on Musical.ly as a video-based equivalent to emoji. Social Media + Society, 3(4), 1–11. https://doi.org/10.1177/2056305117735751
Reviglio, U. (2019). Serendipity as an emerging design principle of the infosphere: Challenges and opportunities. Ethics and Information Technology, 21(2), 151–166. https://doi.org/10.1007/s10676-018-9496-y
Ryan, K. M. (2018). Vertical video: Rupturing the aesthetic paradigm. Visual Communication, 17(2), 245–261. https://doi.org/10.1177/1470357217736660
Salminen, J., Wahid, R., Yang, Y., & Jansen, B. (2024). Engagement patterns in TikTok: An analysis of short video ads. In Proceedings of the 35th ACM Conference on Hypertext and Social Media (pp. 323–329). Poznan, Poland: Association for Computing Machinery. https://doi.org/10.1145/3648188.3677048
Schellewald, A. (2021). Communicative forms on TikTok: Perspectives from digital ethnography. International Journal of Communication, 15, 1437–1457. https://ijoc.org/index.php/ijoc/article/view/16414
Schreiber, M. (2017). Audiences, aesthetics and affordances. Analysing practices of visual communication on social media. Digital Culture and Society, 3(2), 143–163. https://doi.org/10.14361/dcs-2017-0209
Schwartz, B. (2024, April 9). Google tests short videos in search menu bar. Search Engine Roundtable. https://www.seroundtable.com/google-short-videos-in-search-menu-bar-37182.html
Sherman, T. (2021, March 18). Bringing YouTube Shorts to the U.S. YouTube Official Blog. https://blog.youtube/news-and-events/youtube-shorts-united-states/
Shutsko, A. (2020). User-generated short video content in social media. A case study of TikTok. In G. Meiselwitz (Ed.), Social computing and social media. Participation, user experience, consumer experience, and applications of social computing (Vol. 12195, pp. 108–125). Copenhagen, Denmark: Springer International Publishing. https://doi.org/10.1007/978-3-030-49576-3_8
Singh, S. N., & Cole, C. A. (1993). The effects of length, content, and repetition on television commercial effectiveness. Journal of Marketing Research, 30(1), 91–104. https://doi.org/10.1177/002224379303000108
Sinha, D. (2024, September 13). How TikTok works: Decoding system design & architecture with recommendation system. TechAhead. https://www.techaheadcorp.com/blog/decoding-tiktok-system-design-architecture/
Sterne, J. (2012). MP3: The meaning of a format. Durham, NC: Duke University Press. https://doi.org/10.1215/9780822395522
Tedone, D. [@lastoriaperted]. (2025, December 11). Quando la cultura viene “pagata come un hobby,” la società produce “cultura da hobby” [When culture is “paid like a hobby,” society produces “hobbyist culture”] [Video]. Instagram. https://www.instagram.com/p/DSIjmPjDLtC/
TikTok for Business. (2021, December 21). Triple-digit conversion lift: Simple creative tips to make your ads work harder. https://ads.tiktok.com/business/en-US/blog/creative-that-drives-conversions
TikTok Support. (2025). Camera tools. https://support.tiktok.com/en/using-tiktok/creating-videos/camera-tools
Van Leeuwen, T. (1999). Speech, music, sound. Basingstoke, UK: Macmillan. https://doi.org/10.1007/978-1-349-27700-1
Van Leeuwen, T. (2005). Introducing social semiotics. Abington, UK: Routledge.
Vandersmissen, B., Godin, F., Tomar, A., De Neve, W., & Van de Walle, R. (2014). The rise of mobile and social short-form video: An in-depth measurement study of Vine. In Proceedings of the SoMuS ICMR 2014 Workshop (Vol. 1198, pp. 1–10). Glasgow, Scotland: CEUR-WS.
Violot, C., Elmas, T., Bilogrevic, I., & Humbert, M. (2024). Shorts vs. regular videos on YouTube: A comparative analysis of user engagement and content creation trends. arXiv. https://doi.org/10.48550/arXiv.2403.00454
Volmar, A. (2017). Formats as media of cooperation. Media in Action. Interdisciplinary Journal on Cooperative Media, 2, 9–28. https://doi.org/10.25969/mediarep/16226
Wang, X., & Gu, B. (2022). Ethical dimensions of app designs: A case study of photo- and video-editing apps. Journal of Business and Technical Communication, 36(3), 355–400. https://doi.org/10.1177/10506519221087973
Wang, Y. T. (2021). Multimodal analysis: Researching short-form videos and the theatrical practices. In Proceedings of the 104th Association for Education in Journalism and Mass Communication (pp. 12–14). New Orleans, LA: AEJMC. http://hdl.handle.net/10125/76003
Wildfeuer, J. (2014). Film discourse interpretation. New York, NY: Routledge. https://doi.org/10.4324/978020376620
Yang, H., Zhang, S., Diao, Z., & Sun, D. (2023). What motivates users to continue using current short video applications? A dual-path examination of flow experience and cognitive lock-in. Telematics and Informatics, 85, Article 102050. https://doi.org/10.1016/j.tele.2023.102050
Yang, S., Zhao, Y., & Ma, Y. (2019). Analysis of the reasons and development of short video application. Taking TikTok as an example. In 9th International Conference on Information and Social Science (ICISS 2019) (pp. 12–14). Manila, Philippines: Francis Academic Press. https://doi.org/10.25236/iciss.2019.062
Yaqi, Z., Lee, J.-Y., & Liu, S. (2021). Research on the uses and gratifications of TikTok (Douyin short video). International Journal of Contents, 17(1), 37–53. https://doi.org/10.5392/IJoC.2021.17.1.037
YouTube Help. (2025). Get started creating YouTube Shorts. https://support.google.com/youtube/answer/10059070?hl=en
Zamora-Medina, R., Suminas, A., & Fahmy, S. S. (2023). Securing the youth vote: A comparative analysis of digital persuasion on TikTok among political actors. Media and Communication, 11(2), 218–231. https://doi.org/10.17645/mac.v11i2.6348
Zeng, J., & Abidin, C. (2021). “#OkBoomer, time to meet the Zoomers”: Studying the memefication of intergenerational politics on TikTok. Information, Communication & Society, 24(16), 2459–2481. https://doi.org/10.1080/1369118X.2021.1961007
Zeng, J., & Kaye, D. B. V. (2022). From content moderation to visibility moderation: A case study of platform governance on TikTok. Policy & Internet, 14(1), 79–95. https://doi.org/10.1002/poi3.287
Zeng, J., Schäfer, M. S., & Allgaier, J. (2021). Reposting “till Albert Einstein is TikTok famous”: The memetic construction of science on TikTok. International Journal of Communication, 15, 3216–3247.
Zhang, G., Liu, K., Hu, H., & Guo, J. (2021). Short video streaming with data wastage awareness. In 2021 IEEE International Conference on Multimedia and Expo (ICME) (pp. 1–6). Shenzhen, China: IEEE. https://doi.org/10.1109/ICME51207.2021.9428379
Zhang, L., Wang, F., & Liu, J. (2014). Understand instant video clip sharing on mobile platforms: Twitter’s Vine as a case study. In Proceedings of Network and Operating System Support on Digital Audio and Video Workshop (pp. 85–90). New York, NY: Association for Computing Machinery. https://doi.org/10.1145/2578260.2578278
Zhang, W., Mei, J., Song, W., Evans, R., & Xiang, Y. (2021). Why do citizens engage with the TikTok accounts of public hospitals in China? Sage Open, 11(4), Article 21582440211061568. https://doi.org/10.1177/21582440211061568
Zhang, X., Wu, Y., & Liu, S. (2019). Exploring short-form video application addiction: Socio-technical and attachment perspectives. Telematics and Informatics, 42, Article 101243. https://doi.org/10.1016/j.tele.2019.101243
Zhao, H., & Wagner, C. (2023). How TikTok leads users to flow experience: Investigating the effects of technology affordances with user experience level and video length as moderators. Internet Research, 33(2), 820–849. https://doi.org/10.1108/INTR-08-2021-0595
Zibayi, M. [@minazibayi]. (2025, April 13). Make-up tutorial [Video]. YouTube. https://www.youtube.com/shorts/d3x3Y8OawO0
Zulli, D., & Zulli, D. J. (2022). Extending the Internet meme: Conceptualizing technological mimesis and imitation publics on the TikTok platform. New Media & Society, 24(8), 1872–1890. https://doi.org/10.1177/1461444820983603
Copyright © 2026 (Giulia Isabella Guerra).
Licensed under the Creative Commons Attribution Non-commercial No Derivatives (by-nc-nd).
https://doi.org/10.65476/rdgsy150