Skip to main content
Video Demo

AI-Powered Video Recording

Screen capture, camera recording, PiP compositing, noise reduction, and AI transcription with word-level timestamps.

Screen Recording

Capture your screen with audio

Camera Capture

Record from webcam at 1080p

Picture-in-Picture

Screen + camera composite

Audio Only

Record with noise reduction

Preview

Click Record to start

Recording Settings
Resolution: 1920x1080
FPS: 30
Audio: 48kHz

AI Transcription

Start recording to see AI transcription

Powered by OpenAI Whisper

ScreenRecorder

Full-screen capture with customizable bitrate, audio mixing, and quality settings.

PiPCompositor

Picture-in-Picture composite combining screen and camera feeds.

NoiseProcessor

Real-time audio denoising with echo cancellation and auto-gain control.

Transcription

AI-powered transcription via OpenAI Whisper with word-level timestamps.

TextFormatter

Clean transcribed text — removes filler words, fixes punctuation and capitalization.

InstrumentedRecorder

Built-in metrics, correlation IDs, and performance tracking for analytics.

How It Works

AI Kit's video package provides a complete recording pipeline that handles screen capture, camera recording, and audio processing through a unified API. The ScreenRecorder uses the MediaStream API to capture your display at configurable resolution and bitrate, while CameraRecorder provides webcam access with auto-exposure and white balance. The PiPCompositor combines both feeds into a single output, placing the camera feed as an overlay in any corner of the screen recording.

Audio processing runs through the NoiseProcessor, which applies real-time echo cancellation, noise suppression, and auto-gain control using the Web Audio API. Once recording is complete, the Transcription module sends the audio to OpenAI Whisper for AI-powered transcription with word-level timestamps. The TextFormatter post-processes the transcript to remove filler words, fix punctuation, and normalize capitalization. The HighlightDetector identifies key moments in the transcript for automatic chapter generation. All components emit instrumented events for analytics integration.

Use Cases

  • Developer tutorials with screen + webcam PiP recording and auto-generated subtitles from Whisper transcription.
  • Customer support recordings with noise-reduced audio and searchable transcripts for quality review.
  • Meeting documentation with automatic speaker detection, timestamped notes, and highlight extraction.
  • Async video messaging with camera recording, real-time audio denoising, and instant text transcription.

Integration Guide

Record your screen and transcribe with AI in a few lines:

import { ScreenRecorder, transcribeAudio } from '@ainative/ai-kit-video';

const recorder = new ScreenRecorder({
  resolution: { width: 1920, height: 1080 },
  frameRate: 30,
  audioBitrate: 128000,
  noiseReduction: true,
});

// Start recording
await recorder.start();

// Stop and get the recording
const blob = await recorder.stop();

// Transcribe with AI
const transcript = await transcribeAudio(blob, {
  model: 'whisper-1',
  language: 'en',
  timestamps: 'word',
});

console.log(transcript.segments);
// [{ text: "Hello world", start: 0.0, end: 1.2 }, ...]