AccueilClientsExpertisesOpen SourceBlogContactEstimer

2 avril 2025

Document Picture-in-Picture, Pip for any HTML content

6 minutes de lecture

Document Picture-in-Picture, Pip for any HTML content
🇫🇷 This post is also available in french

Picture-in-Picture mode is mostly used to detach a video player into a floating window that is constantly visible and on top, while continuing to browse a page.

Example of Picture-in-Picture in video mode
Example of Picture-in-Picture in video mode

But, in this article, I want to talk about another similar API: The Document Picture-in-Picture (PiP) API.

It takes this idea of a floating window a bit further. Instead of being limited to videos, it allows you to display any HTML document in a PiP window. It allows for the creation of innovative user experiences and new ways of designing interfaces.

In this article, we will see how to use this API before exploring innovative use cases that leverage its potential.

This API is only supported by Chrome, starting from version 116. If needed, there's a flag to enable/disable this feature chrome://flags/#document-picture-in-picture-api.

How the API Works

The API relies on a global object, documentPictureInPicture, which offers several entry points to interact with a Picture-in-Picture window dedicated to any HTML content. Here are the main features:

Main Property

documentPictureInPicture.window

This property returns the active PiP window if it exists, otherwise it returns null. It allows direct access to the window content to, for example, interact with elements or change its appearance.

Method to Open a Picture-in-Picture Window

documentPictureInPicture.requestWindow(options)

This method is at the center of the API. It opens a new PiP window and returns a promise that resolves into an object representing this window.

The options object can include several parameters to customize the window:

  • width and height: They set the initial size of the PiP window.
  • disallowReturnToOpener: By setting this option to true, you hide the button to go back to the original tab. Its default value is false.
  • preferInitialWindowPlacement: If set to true, the window opens in its default position and size, ignoring the last configuration used.

The call must be initiated as a result of user interaction (click, key press, etc.). Otherwise, the promise is rejected.

Once a PiP window is instantiated, you can handle it like any other window object.

Usage Example in JavaScript

First of all, given that the API is not supported by all browsers, it's necessary to verify if it's supported before using it.

if ('documentPictureInPicture' in window) {
  // API is supported
}

Now that we know the API is supported, to display HTML content in a PiP window, simply retrieve the HTML content and add it to the PiP window.

pipButton.addEventListener("click", async () => {
  const myContent = document.querySelector("#content"); 

  const pipWindow = await documentPictureInPicture.requestWindow({
    width: 500,
    height: 500,
  });

  pipWindow.document.body.append(myContent);
});

With a user click, a floating window will be visible (of size 500px x 500px) and will contain the HTML content of the #content element.

Upon creating the window, the PiP document of the latter will be empty. So, of course, you'll need to add your HTML content as we did but also, if needed, add style (same thing for events, you'll have to add them to the new PiP window).

The simplest way is to copy the style of the originating document entirely. For that, you can use the following function:

function copyStyles(pipWindow) {
    [...document.styleSheets].forEach((styleSheet) => {
      try {
        const cssRules = [...styleSheet.cssRules]
          .map((rule) => rule.cssText)
          .join('');
        const style = document.createElement('style');

        style.textContent = cssRules;
        pipWindow.document.head.appendChild(style);
      } catch (e) {
        const link = document.createElement('link');

        link.rel = 'stylesheet';
        link.type = styleSheet.type;
        link.media = styleSheet.media.mediaText;
        link.href = styleSheet.href || '';
        pipWindow.document.head.appendChild(link);
      }
    });
}

To copy all CSS style sheets from the original window, you have to manually go through the styleSheets connected or integrated into the document, and then add them to the Picture-in-Picture window. Be careful, this copy is only done once, the styles will not be updated automatically afterward.

The copyStyleSheets method was supported by Chrome in an earlier version of the API specification but has been removed.

And Voilà, we now have a PiP window that displays HTML content and is styled like the rest of the document.

Document Picture-in-Picture example
Document Picture-in-Picture example

Of course, it is possible to go further by finely customizing the operation and content of the Picture-in-Picture window.

You can listen to/detect its opening, its closing, interact with its content, react via standard HTML document events, etc.

In terms of style, you can also specify style rules that target only the PiP window. Handy for isolating specific style rules for the PiP document without polluting the main document:

@media all and (display-mode: picture-in-picture) {
  body {
    background-color: #000;
    color: #fff;
  }
  h1 {
    font-size: 0.8em;
  }
}

Obviously, there is a React wrapper for this API. But here I wanted to present the API in its native form.

User Interface Use Cases

In this section, I have gathered some concrete and visual use cases to illustrate how this API can improve the user experience, streamline certain interactions, or simply offer a gain in comfort and productivity.

One of the most advanced uses of the Document Picture-in-Picture API to my knowledge is undoubtedly that of Google Meet.

When you detach a meeting via PiP mode, you find a complete and interactive interface in an independent window. You can see participants, enable/disable your microphone, leave the meeting, follow a presentation, raise your hand. Almost all the essential features of a meeting are available in this compact version.

Example of the use of the Document Picture-in-Picture by Google Meet
Example of the use of the Document Picture-in-Picture by Google Meet

It's an excellent example of what this API allows: stay active in a meeting while pursuing another task, without having to switch between tabs or windows. This makes the experience much more fluid, especially in a multitasking context.

From there, we can imagine all sorts of declinations of usage in application components.

Here are a few other concrete illustrated use cases which show how this API can transform the user interface in different contexts.

Interactive media player

Here, I designed a small example of a Soundcloud audio player that would display user comments. As well as, a Twitch live where you could comment live.

GIF of a media player with interactions

Online text editor

In a web application type code editor with preview, you could display the preview in a floating window. So users with two monitors can take full advantage of the editing interface.

Here I took an example available on the Chrome For Developers website.

Example of a text editor with a floating window for the preview
Example of a text editor with a floating window for the preview

Documentation in floating window

Documentation could incorporate a floating window to have important information readily available. And avoid going back and forth between the documentation and the code.

An example of a useQuery documentation in a floating window
An example of a useQuery documentation in a floating window

Progress of a build

Sometimes, you may need to monitor the progress of a build, or any process that may take time.

With a floating window, you can keep an eye on the progress of this process while continuing to work on another task.

Example of a floating window for a build progress
Example of a floating window for a build progress

API Limitations

Although this API is very interesting, it still has some significant limitations to know.

Limited Support

First, its support is currently limited to Google Chrome and Chromium-based browsers.

No Navigation Possible

Next, it's important to note that the PiP window does not allow navigation between different pages or redirection. It's an isolated document, linked to the environment of the main page. The goal is to extend a component of your page, not to create a new autonomous instance of the application.

Only One Floating Window per Tab

For obvious reasons, the browser limits the display to one PiP window at a time (per open tab).

Manual Management of Styles and Scripts

Finally, the style and script system in the PiP window requires manual management. There is no automatic synchronization with the main document, which can represent an additional maintenance effort.

If you want to quickly test Picture-in-Picture behavior on a website. I recommend this Chrome extension which allows you to activate PiP mode on any element of a web page.

Conclusion

The Document Picture-in-Picture API opens the way to more interactive and useful floating interfaces, well beyond simple video. Despite being currently limited to Chrome, it already offers great opportunities for improving the user experience, especially in SaaS contexts or productivity tools. An API to watch and experiment with today.

Resources

À découvrir également

Bun : un outil tout-en-un JavaScript révolutionnaire ?

28 Jan 2025

Bun : un outil tout-en-un JavaScript révolutionnaire ?

Bun est un runtime (et plus encore) JavaScript dont l'axe de communication principal est sa performance...

par

Hugo

Créer un indicateur d'activité audio dans le navigateur

02 Aug 2024

Créer un indicateur d'activité audio dans le navigateur

Dans cet article, nous allons voir comment créer un indicateur d'activité audio dans le navigateur, pour afficher une animation lorsqu'un son est joué. Très utile pour les applications de chat video ou de streaming audio.

par

Quentin

AI et UI #1 - Filtres intelligents avec le SDK Vercel AI et Next.js

12 Jun 2024

AI et UI #1 - Filtres intelligents avec le SDK Vercel AI et Next.js

Dans ce premier article d’une série consacrée à l’IA et l’UI, je vous propose de découvrir différentes manières d’intégrer ces modèles d’IA dans vos applications React pour améliorer l’expérience utilisateur.

par

Baptiste

Premier Octet vous accompagne dans le développement de vos projets avec react

En savoir plusNous contacter
18 avenue Parmentier
75011 Paris
+33 1 43 57 39 11
hello@premieroctet.com

Suivez nos aventures

GitHub
X
Flux RSS

Naviguez à vue