Get User Media for OBS

Choose the media (Video, Audio, Midi or Gamepad) to send to the OBS Browser

Connect to the OBS WebSocket Server

Choose an Audio and/or Video Source




Audio Frequency Analysis


Send Audio and/or Video to a target OBS Browser source via webRTC

Midi

Gamepad

https://github.com/luser/gamepadtest

Press a button on your controller to start

WebSpeech API Speech Recognition

MediaPipe Pose Detection

MediaPipe Hand Landmark

MediaPipe Face Landmark Dectection

MediaPipe Image Segmentation

MediaPipe Text Classification

Classifying text with the MediaPipe Text Classifier Task

This demo listens for event {speechToText}, then runs sentiment analysis. The result shows how likely the input text is to have a positive or negative sentiment.

MediaPipe Gemma LLM Inference

Text-to-Text large language model

Gemma 2B is a part of a family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini models. The model contains 2B parameters and open weights. This model is well-suited for a variety of text generation tasks, including question answering, summarization, and reasoning. https://developers.google.com/mediapipe/solutions/genai/llm_inference
Gemma formatting and system instructions
Input:
example format
<start_of_turn>user
Some question?<end_of_turn>
<start_of_turn>model




Result:

Apple Shortcuts Input

Mouse coordinates

PTZ coordinates