Auth header

x-api-key: YOUR_API_KEY

Content-Type

application/json

Best for

Server-to-server integrations, automation workflows, internal dashboards, partner apps

Prerequisites

Valid API plan · API key issued per workspace · HTTPS requests only · files must be under 10 MB

NOTE:POST /api/v2/videos/generate requires x-api-key. Returns 403 with the message “API v2 requires API key authentication. Provide ‘x-api-key’ header.” if the header is missing.

Your API base URL is provided when you create an API key in the dashboard. All requests must be made over HTTPS.

POST/api/v2/videos/generate

Generate a video from a prompt or script. JSON body, x-api-key required.

generate.js

import fetch from "node-fetch";

const API_KEY = "api-key"; // Replace with your Golpo API key

const payload = {
  prompt: "Explain the quarterly roadmap",
  background_track: "engaging"
};

const BASE_URL = "{BASE_URL}"; // You will receive this when creating an API key
const response = await fetch(`${BASE_URL}/api/v2/videos/generate`, {
  method: "POST",
  headers: {
    "Content-Type": "application/json",
    "x-api-key": API_KEY
  },
  body: JSON.stringify(payload)
});

const data = await response.json();
console.log(data);

Run with: node --env-file=.env generate.js

GET/api/v2/videos

Retrieve videos belonging to the authenticated user. Supports limit and offset query params. x-api-key required.

list-videos.js

import fetch from "node-fetch";

const BASE_URL = "{BASE_URL}"; // You will receive this when creating an API key

// Query params:
//   limit  — max number of videos to return (optional)
//   offset — number of videos to skip (optional, for pagination)
const response = await fetch(`${BASE_URL}/api/v2/videos?limit=10&offset=0`, {
  headers: {
    "x-api-key": "api-key"
  }
});

const data = await response.json();
console.log(data.videos); // array of video objects
console.log(data.total);  // total number of videos

Run with: node --env-file=.env list-videos.js

GET/api/v2/videos/{video_id}

Retrieve metadata for a single video by ID. x-api-key required.

get-video.js

const BASE_URL = "{BASE_URL}"; // You will receive this when creating an API key
const VIDEO_ID = "video_id";

const response = await fetch(`${BASE_URL}/api/v2/videos/${VIDEO_ID}`, {
  headers: { "x-api-key": "api-key" }
});
console.log(await response.json());

Run with: node --env-file=.env get-video.js <video_id>

PATCH/api/v2/videos/{video_id}

Update video metadata. Only 'title' and 'visibility' can be updated.

update-video.js

const BASE_URL = "{BASE_URL}"; // You will receive this when creating an API key
const VIDEO_ID = "video_id";

const payload = { title: "Updated internal briefing", visibility: "private" };

const response = await fetch(`${BASE_URL}/api/v2/videos/${VIDEO_ID}`, {
  method: "PATCH",
  headers: {
    "Content-Type": "application/json",
    "x-api-key": "api-key"
  },
  body: JSON.stringify(payload)
});

console.log(await response.json());

Run with: node --env-file=.env update-video.js <video_id>

DELETE/api/v2/videos/{video_id}

Delete a generated video (soft delete). x-api-key required.

delete-video.js

const BASE_URL = "{BASE_URL}"; // You will receive this when creating an API key
const VIDEO_ID = "video_id";

const response = await fetch(`${BASE_URL}/api/v2/videos/${VIDEO_ID}`, {
  method: "DELETE",
  headers: { "x-api-key": "api-key" }
});

console.log(await response.json());

Run with: node --env-file=.env delete-video.js <video_id>

Only prompt is required. All other parameters are optional.

Parameter	Type	Default	Required	Description
prompt	string	—	Required	Main prompt/topic for the video.
reference_source	array[string]	None	Optional	List of document URLs (https/s3) used as reference material. Use /upload-file to obtain URLs; server-side callers may also pass local file paths.
narration_instructions	string	""	Optional	Describe how the narration voice should sound — accent, tone, pace, and delivery style. Examples: "Speak in a warm British accent with a calm, professorial tone", "Fast-paced sports commentator style".
enable_script_only_mode	boolean	false	Optional	Only generate the script — skip TTS and video rendering. Status response returns script_text. NOTE:This parameter will not work with own_narration_source (own-narration mode). Requests will follow the standard own_narration workflow regardless of the parameter value.
custom_script	string	None	Optional	Use a script you supply instead of generating one. Capped at approximately 15 estimated minutes. NOTE:Scripts longer than ~15 estimated minutes are rejected with 422. If timing is also set, it must be greater than or equal to the estimated script duration. Embedding paired [START][IMAGE N]...[END][IMAGE N] and [START][VIDEO N]...[END][VIDEO N] markers in your script controls where your uploaded custom_images and custom_videos appear. See the Inserting Images in Script and Inserting Videos in Script sections for the placement rules — markers that violate the format are silently dropped.
narration_voice	string	"female-1"	Optional	Narration voice. Allowed values: "female-1" (default), "female-2", "male-1", "male-2".
enable_color	boolean	true	Optional	Enable color video generation pipeline.
narration_language	string	"en"	Optional	Language for narration. Lowercase only — accepts language keywords (e.g. "english", "hindi"), ISO 639-1 codes (e.g. "en", "hi"), or code-switch variants (e.g. "hi-en").
background_track	string	None	Optional	Background music track. Allowed values: "jazz", "lofi", "dramatic", "engaging", "hyper", "inspirational", "documentary".
video_orientation	string	"horizontal"	Optional	Video orientation: "horizontal" (16:9) or "vertical" (9:16).
enable_podcast_engine	boolean	false	Optional	When true, generate audio-only podcast output (no video). Status response returns podcast_url.
timing	string	"1"	Optional	Video/podcast duration in minutes (as string) or "auto" to let the system determine it. Supported values: "0.25", "0.5", "1", "2", "4", "8", "10", "15", "auto". NOTE:"15" is only available on the Enterprise Custom plan. Other plans receive a 403 when requesting 15-minute videos. Contact support@golpoai.com to upgrade.
watermark	boolean	false	Optional	Include a watermark in the video.
custom_logo	string	None	Optional	Custom logo URL (https/s3). Use /upload-file first; server-side callers may also pass local file paths.
logo_position	string	None	Optional	Logo placement. Allowed values: "tl" (top-left), "tr" (top-right), "bl" (bottom-left), "br" (bottom-right). NOTE:Only valid when watermark=true or custom_logo is provided. Setting logo_position without either is rejected with 422, since no logo or watermark would render.
visual_instructions	string	""	Optional	Visual instructions to guide how the video looks. Examples: "Show more graphs and charts", "Cinematic transitions", "Dark moody aesthetic with neon accents".
golpo_video_engine	string	"golpo_canvas"	Optional	Video generation engine: "golpo_canvas" (default) or "golpo_sketch". NOTE:When using golpo_canvas, canvas_style_variant is required. When using golpo_sketch, sketch_style_variant is required. Setting either to null is rejected with 422.
canvas_style_variant	string	"chalkboard_color"	Optional	Golpo Canvas style variant. Allowed values: "chalkboard_bw" (white chalk on black), "chalkboard_black_on_white" (black ink on white), "chalkboard_color" (default), "whiteboard", "modern_minimal", "technical", "sharpie", "playful", "editorial", "illustrations". Required when golpo_video_engine is "golpo_canvas".
sketch_style_variant	string	"classic"	Optional	Golpo Sketch style variant. Allowed values: "classic" (default), "improved(beta)", "formal", "crayon", "dry_erase", "professional_clean", "creative", "infographics", "chalkboard_black_on_white". Required when golpo_video_engine is "golpo_sketch".
own_narration_source	string	None	Optional	URL (https/s3) to an audio or video file to use instead of generating script and TTS. Server-side callers may also pass local file paths. Useful for generating a video from your own existing audio or video narration.
use_ai_audio_at	array[integer]	None	Optional	List of 1-indexed positions in custom_videos whose original audio should be replaced with AI-generated narration. Videos not listed keep their original audio. NOTE:Only valid when custom_videos is provided. Length must not exceed the number of custom_videos.
custom_images	array[string]	None	Optional	List of image URLs (https/s3) to insert into the video. Server-side callers may also pass local file paths.
custom_images_description	array[string]	None	Optional	List of descriptions for custom images (one per image). NOTE:Required when custom_images is provided. Must have the same length as custom_images. Descriptions tell the Golpo AI what each image shows, so it can place each image at the right moment in the script and reference it in the narration. More specific descriptions produce more contextually accurate videos.
keep_original_images	array[boolean]	None	Optional	List of booleans indicating whether to use each image as-is without AI processing (one per image). NOTE:Only valid when custom_images is provided. Length must not exceed the number of custom_images.
disable_image_animation	array[boolean]	None	Optional	List of booleans indicating whether to skip animation for each image (one per image). NOTE:Only valid when custom_images is provided. Length must not exceed the number of custom_images.
custom_videos	array[string]	None	Optional	List of video URLs (https/s3) to insert into the video. Server-side callers may also pass local file paths.
custom_videos_description	array[string]	None	Optional	List of descriptions for custom videos (one per video). NOTE:Required when custom_videos is provided. Must have the same length as custom_videos. Descriptions tell the Golpo AI what each video shows, so it can cut to each clip at the right moment in the script and frame the surrounding narration. More specific descriptions produce more contextually accurate videos.
visibility	string	None	Optional	Video visibility. Allowed values: "public" (appears in the Golpo gallery) or "private".
video_id	string	None	Optional	Optional custom video/job id assigned by the caller.
reference_images	array[string]	None	Optional	List of reference image URLs (https/s3) for problem-solving mode. The video is generated around the provided images. Server-side callers may also pass local file paths. Best suited for explaining math, science, or any problem captured as an image — pair with timing: "auto" so the Golpo AI picks a duration that matches the complexity of the problem.
pen_animation_style	string	None	Optional	Pen cursor animation style. Allowed values: "stylus", "marker", "pen". NOTE:Only supported with the Golpo Canvas engine. Setting this field with "golpo_video_engine": "golpo_sketch" is rejected with 422.
own_narration_video_mode	boolean	None	Optional	When true, the user's own narration video is overlaid as a picture-in-picture thumbnail. Requires own_narration_source to point to a video file. NOTE:Setting own_narration_video_mode=true with a non-video own_narration_source returns 422.
own_narration_video_position	string	"left-top"	Optional	Corner for the picture-in-picture overlay. Allowed values: "left-top" (default), "right-top", "left-bottom", "right-bottom".
scene_pacing	string	"normal"	Optional	Pacing level: "normal" (15s max per scene) or "fast" (10s max per scene). Works with both Golpo Canvas and Golpo Sketch.
onscreen_text_language	string	None	Optional	Language for on-screen text. Same value space as narration_language. Useful when displayed text should differ from the narration language. NOTE:Only supported with the Golpo Canvas engine. Setting this field with "golpo_video_engine": "golpo_sketch" is rejected with 422. If omitted, on-screen text uses the same language as narration_language. Example: narration_language: "hindi", onscreen_text_language: "english" — narration in Hindi, on-screen text in English.
language_variants	array[string]	None	Optional	Generate additional language versions of the video alongside the primary `narration_language`. Provide an array of supported languages (e.g. ["spanish", "french", "hi"]). A separate video will be generated for each specified language. Each language version is billed as a separate render. For example, a 2-minute video with 2 additional languages is billed as 3 renders × 2 minutes. Duplicate languages, including the primary narration_language, are not allowed and will return a 400 error. NOTE:All language variants use the same prompt, duration, voice, and visual settings. Only the narration_language changes for each variant.
variant_visuals_mode	string	"shared"	Optional	Controls how visuals are generated across language variants. `shared` (default) reuses the primary video's visuals across all language variants. `per_language` generates separate visuals for each language variant.

Content

promptcustom_scriptenable_script_only_modereference_sourcereference_images

Narration

narration_voicenarration_instructionsnarration_languageonscreen_text_languagelanguage_variantsvariant_visuals_modeown_narration_sourceown_narration_video_modeown_narration_video_position

Visual

golpo_video_enginecanvas_style_variantsketch_style_variantenable_colorvisual_instructionspen_animation_stylescene_pacing

User Media

custom_imagescustom_images_descriptionkeep_original_imagesdisable_image_animationcustom_videoscustom_videos_descriptionuse_ai_audio_at

Audio

background_track

Format

timingvideo_orientationenable_podcast_engine

Branding

watermarkcustom_logologo_position

Workflow

visibilityvideo_id

NOTE:File size must be less than 10 MB. Larger files are rejected with HTTP 413. The upload endpoint is gated to Business Plus plans.

NOTE:Document URLs are single-use. When you pass a document URL (PDF, DOCX, TXT, etc.) from /upload-file into reference_source, the file is permanently deleted from storage after its text is extracted during generation. The same URL cannot be reused for another video or podcast — you must upload the document again via /upload-file to get a fresh URL for each new generation.

Content-Type:multipart/form-data

Field	Type	Required	Description
file	File	Yes	Single file to upload (file metadata is used to generate a presigned URL).

Documents

PDF · DOC · DOCX · TXT · MD · CSV · XLSX · XLS · PPT · PPTX

Audio

MP3 · WAV · M4A · OGG · FLAC

Video

MP4 · MOV · M4V · AVI · MKV · WEBM

Images

JPG · JPEG · PNG · GIF · WEBP

upload-response.json

// Response from POST /api/v2/videos/upload-file
{
  "upload_url":   "https://...",  // Presigned PUT URL — upload the file here
  "file_url":     "https://...",  // Pass this into /generate
  "file_name":    "document.pdf",
  "content_type": "application/pdf"
}

upload-file.js

const API_KEY = "api-key";
const BASE_URL = "{BASE_URL}";

// Step 1: Get presigned URL
async function uploadFile(file) {
  const formData = new FormData();
  formData.append('file', file);

  const response = await fetch(`${BASE_URL}/api/v2/videos/upload-file`, {
    method: 'POST',
    headers: { 'x-api-key': API_KEY },
    body: formData
  });

  const { upload_url, file_url, content_type } = await response.json();

  // Step 2: Upload bytes to the presigned URL
  await fetch(upload_url, {
    method: 'PUT',
    headers: { 'Content-Type': content_type },
    body: file
  });

  // Step 3: Return file_url for use in /generate
  return file_url;
}

// Usage
const fileInput = document.querySelector('#file-input');
const fileUrl = await uploadFile(fileInput.files[0]);
console.log('File uploaded, URL:', fileUrl);
// Use fileUrl in your /generate request (reference_source, custom_logo, etc.)

NOTE:custom_images and custom_videos accept the file_url returned by /upload-file. Server-side callers may pass local file paths directly. Description arrays (custom_images_description, custom_videos_description) are required and must match the length of their image/video arrays.

generate-with-user-assets.js

const API_KEY = "api-key";
const BASE_URL = "{BASE_URL}";

async function uploadFile(file) {
  const formData = new FormData();
  formData.append('file', file);

  const presign = await fetch(`${BASE_URL}/api/v2/videos/upload-file`, {
    method: 'POST',
    headers: { 'x-api-key': API_KEY },
    body: formData
  });
  const { upload_url, file_url, content_type } = await presign.json();

  await fetch(upload_url, {
    method: 'PUT',
    headers: { 'Content-Type': content_type },
    body: file
  });

  return file_url;
}

// Step 1: Upload images and videos
const image1Url = await uploadFile(imageFile1);
const image2Url = await uploadFile(imageFile2);
const video1Url = await uploadFile(videoFile1);
const video2Url = await uploadFile(videoFile2);

// Step 2: Generate with custom images and videos
const generatePayload = {
  prompt: "Create a product showcase video",
  custom_images: [image1Url, image2Url],
  custom_images_description: [
    "Product front view with logo",
    "Product in use by customer"
  ],
  keep_original_images: [false, false],
  disable_image_animation: [false, true],
  custom_videos: [video1Url, video2Url],
  custom_videos_description: [
    "Customer testimonial",
    "Product walkthrough"
  ],
  // 1-indexed positions in custom_videos whose original audio
  // should be replaced with AI-generated narration.
  use_ai_audio_at: [2],
  video_orientation: "horizontal",
  watermark: false
};

const response = await fetch(`${BASE_URL}/api/v2/videos/generate`, {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
    'x-api-key': API_KEY
  },
  body: JSON.stringify(generatePayload)
});

const result = await response.json();
console.log('Job ID:', result.job_id);

NOTE:Local paths are only supported in server-side environments where the API server has filesystem access. For remote clients, upload via /upload-file first.

generate-with-local-paths.js

import fetch from "node-fetch";

const API_KEY = "api-key";
const BASE_URL = "{BASE_URL}";

// Server-side only: when the API server has filesystem access
// to these files, it uploads them transparently before generation.
const payload = {
  prompt: "Create a comprehensive product demo video",
  reference_source: [
    "./documents/product-spec.pdf",
    "C:\\Users\\Documents\\brief.docx",
    "/home/user/documents/slides.pptx"
  ],
  custom_logo: "./assets/company-logo.png",
  custom_images: [
    "./images/product-front.jpg",
    "./images/product-in-use.jpg"
  ],
  custom_images_description: [
    "Product front view with branding",
    "Product being used by customer"
  ],
  keep_original_images: [false, false],
  disable_image_animation: [false, true],
  own_narration_source: "./audio/founder-narration.mp3",
  narration_voice: "female-1",
  background_track: "engaging",
  video_orientation: "horizontal",
  watermark: false,
  enable_color: true,
  narration_language: "en",
  timing: "5"
};

const response = await fetch(`${BASE_URL}/api/v2/videos/generate`, {
  method: "POST",
  headers: {
    "Content-Type": "application/json",
    "x-api-key": API_KEY
  },
  body: JSON.stringify(payload)
});

const data = await response.json();
console.log("Job ID:", data.job_id);

Local paths only work when the API server can access these files. Remote clients must upload via /upload-file first.

Upload Workflow

Full pattern: upload supporting files via /upload-file, then call /generate with the returned URLs in reference_source.

generateVideoWithUploads.js

const API_KEY = "api-key";
const BASE_URL = "{BASE_URL}";

// Step 1: Helper to upload a file via /upload-file
async function uploadFile(file) {
  const formData = new FormData();
  formData.append('file', file);

  const presign = await fetch(`${BASE_URL}/api/v2/videos/upload-file`, {
    method: 'POST',
    headers: { 'x-api-key': API_KEY },
    body: formData
  });
  const { upload_url, file_url, content_type } = await presign.json();

  await fetch(upload_url, {
    method: 'PUT',
    headers: { 'Content-Type': content_type },
    body: file
  });

  return file_url;
}

// Step 2: Upload all reference docs and collect URLs
const fileInput = document.querySelector('#uploads');
const referenceUrls = await Promise.all(
  Array.from(fileInput.files).map((f) => uploadFile(f))
);

// Step 3: Generate the video with the uploaded references
const generatePayload = {
  prompt: 'Summarize this slide deck for executives',
  reference_source: referenceUrls,
  narration_voice: 'female-1',
  narration_language: 'spanish'
};

const response = await fetch(`${BASE_URL}/api/v2/videos/generate`, {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
    'x-api-key': API_KEY
  },
  body: JSON.stringify(generatePayload)
});

const result = await response.json();
console.log('Generation started:', result);

Attach files via <input id="uploads" type="file" multiple /> before calling this helper.

Golpo Sketch

golpo_video_engine: golpo_sketch

Whiteboard-style sketch animation. Pick a sketch_style_variant and optionally tune scene_pacing (normal/fast).

golpo-sketch.js

import fetch from "node-fetch";

const API_KEY = "api-key";
const BASE_URL = "{BASE_URL}";

const payload = {
  prompt: "Explain how neural networks work",
  golpo_video_engine: "golpo_sketch",
  // sketch_style_variant options:
  //   "classic" (default), "improved(beta)", "formal",
  //   "crayon", "dry_erase", "professional_clean",
  //   "creative", "infographics"
  sketch_style_variant: "improved(beta)",
  scene_pacing: "fast",              // "normal" (15s max) | "fast" (10s max)
  background_track: "engaging",
  watermark: false
};

const response = await fetch(`${BASE_URL}/api/v2/videos/generate`, {
  method: "POST",
  headers: {
    "Content-Type": "application/json",
    "x-api-key": API_KEY
  },
  body: JSON.stringify(payload)
});

console.log(await response.json());

Run with: node --env-file=.env golpo-sketch.js

Golpo Canvas

golpo_video_engine: golpo_canvas

Canvas-based video with rich style variants (chalkboard, whiteboard, editorial…) plus pen_animation_style and scene_pacing.

golpo-canvas.js

import fetch from "node-fetch";

const API_KEY = "api-key";
const BASE_URL = "{BASE_URL}";

const payload = {
  prompt: "Create a product overview video",
  golpo_video_engine: "golpo_canvas",
  // canvas_style_variant options:
  //   "chalkboard_bw", "chalkboard_color" (default), "whiteboard",
  //   "modern_minimal", "technical", "sharpie", "playful", "editorial",
  //   "illustrations"
  canvas_style_variant: "modern_minimal",
  pen_animation_style: "stylus",       // "stylus" | "marker" | "pen"
  scene_pacing: "normal",              // "normal" (15s max) | "fast" (10s max)
  watermark: false
};

const response = await fetch(`${BASE_URL}/api/v2/videos/generate`, {
  method: "POST",
  headers: {
    "Content-Type": "application/json",
    "x-api-key": API_KEY
  },
  body: JSON.stringify(payload)
});

console.log(await response.json());

Run with: node --env-file=.env golpo-canvas.js

Display Language

onscreen_text_language

Generate a video where the narration and the on-screen text are in different languages — e.g. Hindi voice-over with English text displayed in the video. Only supported with the Golpo Canvas engine; setting onscreen_text_language with golpo_video_engine: golpo_sketch is rejected with 422.

display-language.js

import fetch from "node-fetch";

const API_KEY = "api-key";
const BASE_URL = "{BASE_URL}";

// Narrator speaks Hindi; on-screen text, titles, and labels are in English.
const payload = {
  prompt: "Explain the water cycle",
  narration_language: "hindi",
  onscreen_text_language: "english",
  narration_voice: "female-1",
  watermark: false
};

const response = await fetch(`${BASE_URL}/api/v2/videos/generate`, {
  method: "POST",
  headers: {
    "Content-Type": "application/json",
    "x-api-key": API_KEY
  },
  body: JSON.stringify(payload)
});

console.log(await response.json());

Run with: node --env-file=.env display-language.js

The table below shows the maximum number of languages you can generate per request for each timing value — both the total (primary + variants) and how many you can list in language_variants.

timing	Max languagesprimary + variants	Max language_variants
0.250.51	10	9
2	5	4
4	3	2
81015	2	1
auto	2	1

2–10

Max languages

depends on timing (≤1 min: 10; 2 min: 5; 4 min: 3; ≥8 min: 2)

API request

generates a separate render for each provided language_variants value

2 modes

Visuals

shared (default) reuses the primary's visuals across every language · per_language regenerates visuals separately for each language

✓

shared (default)

Same visuals for every variant — only the audio changes per language.

• All variants use the primary video's visuals

↻

per_language

Generate separate visuals for each language variant, with on-screen text in that language.

• Podcasts (enable_podcast_engine: true) always use this mode, since there are no visuals to share

Variant status lifecycle

queued_for_variantgeneratingcompleted/failedPoll GET /api/v2/videos/status/{job_id}

NOTE:Billing. Each variant bills as a full render — a 2-minute primary + 2 extra languages bills as 3 × 2-minute renders. Duplicate languages or duplicates of the primary are rejected with 400.

Example payload for each visuals mode

shared.json

{
  "prompt": "Explain photosynthesis",
  "narration_language": "english", // Primary language
  "language_variants": ["spanish", "french"],
  // Uses the same visuals across all language variant videos
  "variant_visuals_mode": "shared"
}

per_language.json

{
  "prompt": "Explain photosynthesis",
  "narration_language": "english", // Primary language
  "language_variants": ["spanish", "french"],
  // Generate separate visuals for each language
  "variant_visuals_mode": "per_language"
}

Multi-Language Variants

language_variants

Submit one prompt; receive one video per language. The example below renders English + Spanish + French in shared visuals mode.

multi-language-variants.js

import fetch from "node-fetch";

const API_KEY = "api-key";
const BASE_URL = "{BASE_URL}";

// Generate the same video in English (primary) + Spanish + French.
// "shared" visuals mode reuses the primary's frames so variants render
// fast and cost less than full re-renders.
const payload = {
  prompt: "Explain how photosynthesis works in simple terms.",
  narration_language: "english",          // Primary narration language
  onscreen_text_language: "english",
  narration_voice: "female-1",
  golpo_video_engine: "golpo_canvas",
  canvas_style_variant: "playful",
  enable_color: true,
  timing: "4",

  // Extra languages (max depends on timing — 2 here at timing "4")
  language_variants: ["spanish", "french"],
  variant_visuals_mode: "shared", // "shared" or "per_language"
};

const response = await fetch(`${BASE_URL}/api/v2/videos/generate`, {
  method: "POST",
  headers: {
    "Content-Type": "application/json",
    "x-api-key": API_KEY,
  },
  body: JSON.stringify(payload),
});

const data = await response.json();
console.log("Primary job_id   :", data.job_id);
console.log("Variant video IDs:", data.variant_video_ids); // parallel to language_variants

// Poll the primary AND each variant independently.
for (const id of [data.job_id, ...data.variant_video_ids]) {
  const s = await fetch(`${BASE_URL}/api/v2/videos/status/${id}`, {
    headers: { "x-api-key": API_KEY },
  }).then(r => r.json());
  console.log(id, "→", s.status, s.video_url ?? "");
}

Run with: node --env-file=.env multi-language-variants.js

Generate Video

POST /api/v2/videos/generate

Generate a video from a prompt or script with customizable voice, visuals, music, and branding.

generate.js

import fetch from "node-fetch";

const API_KEY = "api-key";
const BASE_URL = "{BASE_URL}";

const payload = {
  prompt: "Explain the quarterly roadmap",
  background_track: "engaging"
};

const response = await fetch(`${BASE_URL}/api/v2/videos/generate`, {
  method: "POST",
  headers: {
    "Content-Type": "application/json",
    "x-api-key": API_KEY
  },
  body: JSON.stringify(payload)
});

console.log(await response.json());

Run with: node --env-file=.env generate.js

List Videos

GET /api/v2/videos

Retrieve videos belonging to the authenticated user. Supports limit and offset query params.

list-videos.js

import fetch from "node-fetch";

const BASE_URL = "{BASE_URL}";

// Query params:
//   limit  — max number of videos to return (optional)
//   offset — number of videos to skip (optional, for pagination)
const response = await fetch(`${BASE_URL}/api/v2/videos?limit=10&offset=0`, {
  headers: { "x-api-key": "api-key" }
});

const data = await response.json();
console.log(data.videos);
console.log(data.total);

Run with: node --env-file=.env list-videos.js

Get Video

GET /api/v2/videos/{video_id}

Retrieve metadata for a single video by ID.

get-video.js

const BASE_URL = "{BASE_URL}";
const VIDEO_ID = "video_id";

const response = await fetch(`${BASE_URL}/api/v2/videos/${VIDEO_ID}`, {
  headers: { "x-api-key": "api-key" }
});
console.log(await response.json());

Run with: node --env-file=.env get-video.js <video_id>

Update Video

PATCH /api/v2/videos/{video_id}

Update video metadata. Only 'title' and 'visibility' can be updated.

update-video.js

const BASE_URL = "{BASE_URL}";
const VIDEO_ID = "video_id";

// Only 'title' and 'visibility' are accepted.
const payload = { title: "Updated internal briefing", visibility: "private" };

const response = await fetch(`${BASE_URL}/api/v2/videos/${VIDEO_ID}`, {
  method: "PATCH",
  headers: {
    "Content-Type": "application/json",
    "x-api-key": "api-key"
  },
  body: JSON.stringify(payload)
});

console.log(await response.json());

Run with: node --env-file=.env update-video.js <video_id>

Delete Video

DELETE /api/v2/videos/{video_id}

Soft-delete a generated video.

delete-video.js

const BASE_URL = "{BASE_URL}";
const VIDEO_ID = "video_id";

const response = await fetch(`${BASE_URL}/api/v2/videos/${VIDEO_ID}`, {
  method: "DELETE",
  headers: { "x-api-key": "api-key" }
});

console.log(await response.json());

Run with: node --env-file=.env delete-video.js <video_id>

Job Status

GET /api/v2/videos/status/{job_id}

Poll the status endpoint to track generation progress. Response shape depends on the original /generate request — script_text, podcast_url, or video_url.

job-status.js

const BASE_URL = "{BASE_URL}";
const JOB_ID = "job-id";

const response = await fetch(`${BASE_URL}/api/v2/videos/status/${JOB_ID}`, {
  headers: { "x-api-key": "api-key" }
});

const body = await response.json();
console.log(body);
// Three response shapes (depending on original generate request):
//   { job_id, status, created_at, script_text }   // enable_script_only_mode=true
//   { job_id, status, created_at, podcast_url }   // enable_podcast_engine=true
//   { job_id, status, created_at, video_url }     // default (video)

Run with: node --env-file=.env job-status.js <job_id>

Golpo Canvas

Use with golpo_video_engine: "golpo_canvas".

Chalkboard White-on-Blackchalkboard_bw

Chalkboard Colorchalkboard_color

Whiteboardwhiteboard

Modern Minimalmodern_minimal

Technicaltechnical

Sharpiesharpie

Playfulplayful

Editorialeditorial

Illustrationsillustrations

Chalkboard Black-on-Whitechalkboard_black_on_white

Golpo Sketch

Use with golpo_video_engine: "golpo_sketch".

Classicclassic

Improved (Beta)improved(beta)

Formalformal

Crayoncrayon

Dry Erasedry_erase

Professional Cleanprofessional_clean

Creativecreative

Infographicsinfographics

Chalkboard Black-on-Whitechalkboard_black_on_white

Pen in Hand — a real hand draws it on screen

Golpo Canvas only. Add pen_animation_style to render a drawing hand — allowed values "pen", "marker", "stylus".

Pen Whiteboardpen

Marker Whiteboardmarker

Stylus Whiteboardstylus

Background Music

Set background_track to add a soundtrack. Tap any track to preview it.

NOTE:Frame editing is only supported for Golpo Sketch videos (golpo_video_engine: "golpo_sketch"). The source video must have finished rendering before you can edit or combine its frames. Like the rest of API v2 endpoints, every editing endpoint requires thex-api-key.

Editing workflow

frame-versionsedit-framesedits/{id}/statuscombine-framesfinal video

set-frame-version is optional — use it to revert a frame to an earlier take from its version history. Both edit-frames and combine-frames are async; poll their job IDs at the shared edits/{edit_job_id}/status endpoint.

List Frame Versions

GET/api/v2/videos/{video_id}/frame-versions

Return the per-frame animation history for a sketch video — the currently active MP4 for each frame plus every previous version. Use this to discover frame indices before editing or to find a URL to revert to. Requires x-api-key.

frame-versions.js

import fetch from "node-fetch";

const BASE_URL = "{BASE_URL}"; // You will receive this when creating an API key
const VIDEO_ID = "video_id";   // a completed Golpo Sketch video

const response = await fetch(
  `${BASE_URL}/api/v2/videos/${VIDEO_ID}/frame-versions`,
  { headers: { "x-api-key": "api-key" } }
);

const data = await response.json();
// {
//   video_id: "...",
//   frame_animations: { "0": "https://...0.mp4?v=...", "1": "..." },        // active per frame
//   frame_animation_versions: { "0": [{ url, ts, note }, ...] }             // full history per frame
// }
console.log(data.frame_animations);

Run with: node --env-file=.env frame-versions.js

Edit Frames

POST/api/v2/videos/{video_id}/edit-frames

Re-render one or more frames from natural-language instructions. frame_ids and edit_prompts must be the same length (one prompt per frame). Each frame returns its own pollable edit_job_id. Requires x-api-key.

edit-frames.js

import fetch from "node-fetch";

const BASE_URL = "{BASE_URL}";
const VIDEO_ID = "video_id";

// frame_ids[i] is edited using edit_prompts[i] — the two arrays must match in length.
// reference_images is optional style guidance (https/s3 URLs).
const payload = {
  frame_ids: ["0", "2"],
  edit_prompts: [
    "Make the title text blue and bolder",
    "Add an arrow pointing to the diagram"
  ],
  reference_images: ["https://example.com/style.png"] // optional
};

const response = await fetch(
  `${BASE_URL}/api/v2/videos/${VIDEO_ID}/edit-frames`,
  {
    method: "POST",
    headers: { "Content-Type": "application/json", "x-api-key": "api-key" },
    body: JSON.stringify(payload)
  }
);

const data = await response.json();
// {
//   video_id: "...", status: "processing", message: "Started 2 frame edit job(s)",
//   edit_jobs: [{ frame_id: "0", edit_job_id: "..." }, { frame_id: "2", edit_job_id: "..." }]
// }
// Poll each edit_job_id at GET /api/v2/videos/edits/{edit_job_id}/status.
console.log(data.edit_jobs);

Run with: node --env-file=.env edit-frames.js

Edit Job Status

GET/api/v2/videos/edits/{edit_job_id}/status

Poll the status of a frame edit or combine job. This endpoint supports both /edit-frames and /combine-frames jobs. Returns 'processing', a 'completed' result with the MP4 url, or 'failed' with an error message. Requires x-api-key.

edit-status.js

import fetch from "node-fetch";

const BASE_URL = "{BASE_URL}";
const EDIT_JOB_ID = "edit_job_id"; // from /edit-frames or /combine-frames

const response = await fetch(
  `${BASE_URL}/api/v2/videos/edits/${EDIT_JOB_ID}/status`,
  { headers: { "x-api-key": "api-key" } }
);

const data = await response.json();
// { status: "processing" }                       -> still rendering
// { status: "completed", url: "https://...mp4" } -> edited frame / combined video
// { status: "failed", error: "..." }
console.log(data.status, data.url ?? "");

Run with: node --env-file=.env edit-status.js

Combine Frames

POST/api/v2/videos/{video_id}/combine-frames

Concatenate per-frame MP4s into a fresh final video, muxing audio from the original render. You must pass exactly one URL per frame, in order (a count mismatch returns 422). Mix edited URLs (from /edit-frames) with the unchanged frame_animations URLs (from /frame-versions). Returns a combine_job_id to poll via the status endpoint. Requires x-api-key.

combine-frames.js

import fetch from "node-fetch";

const BASE_URL = "{BASE_URL}";
const VIDEO_ID = "video_id";

// One MP4 per frame, in order: edited URLs for frames you changed, and the
// existing frame_animations URLs (from /frame-versions) for the rest.
const payload = {
  frame_animation_urls: [
    "https://.../frame-0-edited.mp4",
    "https://.../frame-1.mp4",
    "https://.../frame-2-edited.mp4"
  ]
};

const response = await fetch(
  `${BASE_URL}/api/v2/videos/${VIDEO_ID}/combine-frames`,
  {
    method: "POST",
    headers: { "Content-Type": "application/json", "x-api-key": "api-key" },
    body: JSON.stringify(payload)
  }
);

const data = await response.json();
// { combine_job_id: "...", video_id: "...", status: "processing" }
// Poll GET /api/v2/videos/edits/{combine_job_id}/status for the final video URL.
console.log(data.combine_job_id);

Run with: node --env-file=.env combine-frames.js

Set Frame Version

POST/api/v2/videos/{video_id}/set-frame-version

Make a specific animation version the active one for a frame — e.g. revert to an earlier take. The frame_animation_url should come from frame_animation_versions[frame_id] returned by /frame-versions. The response returns the activated URL. Requires x-api-key.

set-frame-version.js

import fetch from "node-fetch";

const BASE_URL = "{BASE_URL}";
const VIDEO_ID = "video_id";

// v2 names this field "frame_animation_url" (v1 calls it "url"). It should be one
// of the versions listed in frame_animation_versions[frame_id] from /frame-versions.
const payload = {
  frame_id: "0",
  frame_animation_url: "https://.../frame-0-v2.mp4"
};

const response = await fetch(
  `${BASE_URL}/api/v2/videos/${VIDEO_ID}/set-frame-version`,
  {
    method: "POST",
    headers: { "Content-Type": "application/json", "x-api-key": "api-key" },
    body: JSON.stringify(payload)
  }
);

const data = await response.json();
// { video_id: "...", frame_id: "0", frame_animation_url: "https://.../frame-0-v2.mp4?v=..." }
console.log(data.frame_animation_url);

Run with: node --env-file=.env set-frame-version.js

Value	Duration
"0.25"	15 seconds
"0.5"	30 seconds
"1"	1 minute
"2"	2 minutes
"4"	4 minutes
"8"	8 minutesBeta
"10"	10 minutesBeta
"15"	15 minutesBeta
"auto"	System-determined

Supported values: "0.25", "0.5", "1", "2", "4", "8", "10", "15", "auto".

NOTE:"15" is restricted: 15-minute videos are only available on the Enterprise Custom plan. Other plans receive a 403 when requesting 15-minute videos. Contact support@golpoai.com to upgrade.

When to use "auto" timing: Use auto timing when the content complexity should determine the video length. Especially useful for problem-solving — pass a math or physics problem via reference_images and the golpo engine will analyze the complexity and generate a script of the appropriate length.

Example: Auto Timing with Problem Solving

Pass a problem image via reference_images with Golpo Canvas and let the engine decide how long the explanation should be.

auto-timing-example.json

{
  "prompt": "Solve and explain this calculus problem step by step",
  "reference_images": [
    "https://your-bucket.s3.amazonaws.com/calculus-problem.png"
  ],
  "golpo_video_engine": "golpo_canvas",
  "canvas_style_variant": "whiteboard",
  "timing": "auto",
  "narration_voice": "male-1",
  "video_orientation": "horizontal"
}

Value	Format	Dimensions	Use Cases
"vertical"	Vertical / Portrait	1024×1536 px	TikTok, Instagram Reels, YouTube Shorts
"horizontal"	Horizontal / Landscape (Default)	1536×1024 px	YouTube, standard video content

9:16

vertical

1024×1536 px

16:9

horizontal (default)

1536×1024 px

Value	Voice Type	Description
"female-1"	Female 1	Female narrator voice (Default)
"female-2"	Female 2	Alternative female narrator voice
"male-1"	Male 1	Male narrator voice
"male-2"	Male 2	Alternative male narrator voice

background_track key	Mood / Usage
"jazz"	Warm, neutral bed
"lofi"	Calm, study vibes
"dramatic"	Cinematic tension
"engaging"	Subtle corporate pulse
"hyper"	High-energy electronic
"inspirational"	Uplifting orchestral
"documentary"	Serious factual tone

Pass instructions in the narration_instructions field to guide voice generation. The AI will adjust accent, tone, pace, and delivery accordingly.

narration-instructions-example.json

{
  "prompt": "Explain quantum computing basics",
  "narration_instructions": "Speak in a warm British accent with a calm, professorial tone. Pause slightly between key concepts for emphasis.",
  "narration_voice": "male-1"
}

More examples

•“Talk in a French accent”

•“Use an enthusiastic, energetic tone”

•“Talk like a professor — measured and articulate”

•“Speak slowly and clearly, like a meditation guide”

•“Fast-paced sports commentator style”

•“Friendly and casual, like chatting with a friend”

Pass instructions in the visual_instructions field to guide the visual generation. The AI will adjust scene composition, imagery, and visual style accordingly.

visual-instructions-example.json

{
  "prompt": "Company quarterly results overview",
  "visual_instructions": "Use clean corporate visuals with data charts and graphs. Include stock footage of modern office environments. Prefer blue and white color palette.",
  "narration_voice": "female-1"
}

More examples

•“Show more graphs and charts”

•“Include more stock footage”

•“Use cinematic transitions”

•“Female presenter in an office setting”

•“Urban background with modern architecture”

•“Dark moody aesthetic with neon accents”

Value	Engine	Description
"golpo_canvas"	Golpo Canvas (Default)	Canvas-based video with rich style variants. Requires `canvas_style_variant`.
"golpo_sketch"	Golpo Sketch	Whiteboard line-art animation. Requires `sketch_style_variant`.

NOTE:Setting canvas_style_variant to null with golpo_canvas(or sketch_style_variant to null with golpo_sketch) is rejected with HTTP 422.

Value	Label	Description
"chalkboard_bw"	Chalkboard (White on Black)	White chalk on a black background
"chalkboard_black_on_white"	Chalkboard (Black on White)	Black ink on a white background
"chalkboard_color"	Chalkboard Color	Colorful neon chalkboard style (Default)
"whiteboard"	Whiteboard	Clean whiteboard illustrations
"modern_minimal"	Modern Minimal	Sleek, minimal modern aesthetic
"technical"	Technical	Technical diagram style
"sharpie"	Sharpie	Bold marker/sharpie drawn style
"playful"	Playful	Fun, colorful playful illustrations
"editorial"	Editorial	Magazine/editorial illustration style
"illustrations"	Illustrations	Vox-style editorial explainer with paper-textured infographic look

golpo-canvas-example.json

{
  "prompt": "How solar panels convert sunlight to electricity",
  "golpo_video_engine": "golpo_canvas",
  "canvas_style_variant": "modern_minimal",
  "pen_animation_style": "stylus",
  "scene_pacing": "normal",
  "timing": "2"
}

Value	Label	Description
"classic"	Classic	Original Golpo Sketch — classic whiteboard line-art animation (Default)
"improved(beta)"	Improved	Improved line-art with cleaner strokes and a more polished look
"formal"	Formal	Advanced sketch generation with higher detail and refined aesthetics
"crayon"	Crayon	Storytelling crayon sketch look
"dry_erase"	Dry Erase	Whiteboard dry-erase aesthetic
"professional_clean"	Professional Clean	Clean, minimal professional sketch
"creative"	Creative	After Skool whiteboard-marker variant with bold colorful strokes
"infographics"	Infographics	Vox-style editorial explainer with paper-textured infographic look
"chalkboard_black_on_white"	Chalkboard (Black on White)	Black ink on a white background

scene_pacing applies to both engines: "normal" caps frames at 15s; "fast" caps frames at 10s.

Value	Description
null	No pen cursor (Default)
"stylus"	Thin stylus pen cursor
"marker"	Thick marker cursor
"pen"	Classic pen cursor

When you supply your own script via custom_script and also upload custom_images, wrap a sentence in paired markers to control where each image appears. [IMAGE 1] refers to the first image in custom_images, [IMAGE 2] to the second, and so on (marker numbers start at 1, not 0). The image displays for the duration of the spoken text between [START][IMAGE N] and [END][IMAGE N].

Follow these rules to make sure your image inserts are never silently skipped:

Use 1 for your first uploaded image, 2 for the second, and so on. [IMAGE 1] → first item in custom_images. If you write [IMAGE 3] but only uploaded 2 images, that image won’t appear.
Every [START][IMAGE N] needs a matching [END][IMAGE N] with the same number. Missing or mismatched closing tags drop the marker.
Put whitespace around marker tags when they sit next to sentence punctuation. Always leave a space (or newline) between the preceding sentence's period and [START], and between [END][IMAGE N] and the next sentence. Write story. [START][IMAGE 2] and [END][IMAGE 1] Next sentence. — never story.[START][IMAGE 2] or [END][IMAGE 1]Next sentence.. Without these spaces the sentence splitter cannot find a clean break and the marker is dropped.
Wrap a complete, simple sentence of 8–15 plain words between the markers. Shorter than 5 words is fragile; longer than 20 holds the image on-screen too long.
Avoid colons (:), em-dashes (—), semicolons (;), and parentheses inside the marker span.
Keep commas inside the marker to at most one or two. Stacked lists like "not X, not Y, not Z" confuse the matcher.

NOTE:Image markers that violate any of these rules are silently dropped — no error, no warning. The frame at that position falls back to an AI-generated visual and your uploaded image never appears.

Example 1 — Good shape (use as a template):

image-markers-good-shape.json

{
  "custom_script": "Travel changes how we see the world. [START][IMAGE 1] This is a quiet beach at sunset with golden waves rolling onto soft sand. [END][IMAGE 1] Coastal trips help us slow down and breathe deeply for a few days. [START][IMAGE 2] This is a mountain village tucked between tall green peaks and clouds. [END][IMAGE 2] Highland trips reward you with cool air and stunning scenic views.",
  "custom_images": [
    "https://example.com/beach.png",
    "https://example.com/mountain.png"
  ],
  "custom_images_description": [
    "A quiet beach at sunset",
    "A mountain village among green peaks"
  ]
}

Each wrapped sentence is ~10–12 plain words, contains no colons or em-dashes, has a single space after each preceding period, and the marker numbers (1, 2) match the 2-item custom_images array.

You can also use other image control parameters alongside the markers: keep_original_images (boolean array — skip AI processing and place the image untouched) and disable_image_animation (boolean array — render as a static frame instead of an animated drawing).

Example 2 — Common mistakes and their fixes:

image-markers-missing-space.json

// ❌ Mistake A: no space BEFORE [START] — IMAGE 2 won't appear
{
  "custom_script": "...each region tells a unique story.[START][IMAGE 2] Rajasthan glows proudly with royal forts and deserts[END][IMAGE 2]."
}

// ✅ Fix A: add a single space between "story." and "[START][IMAGE 2]"
{
  "custom_script": "...each region tells a unique story. [START][IMAGE 2] Rajasthan glows proudly with royal forts and deserts[END][IMAGE 2]."
}

// ❌ Mistake B: no space AFTER [END] — IMAGE 1 won't appear
{
  "custom_script": "[START][IMAGE 1] A coastal town glows at sunset by the harbor. [END][IMAGE 1]The next morning brings calm waters and a quiet beach."
}

// ✅ Fix B: add a single space between "[END][IMAGE 1]" and "The"
{
  "custom_script": "[START][IMAGE 1] A coastal town glows at sunset by the harbor. [END][IMAGE 1] The next morning brings calm waters and a quiet beach."
}

When you supply your own script via custom_script and also upload custom_videos, wrap the narration that should play over video N with a paired [START][VIDEO N] ... [END][VIDEO N] block — the same shape used for image markers. [VIDEO 1] refers to the first video in custom_videos, [VIDEO 2] to the second, and so on (marker numbers start at 1, not 0). The narration text between the [START] and [END] markers determines the on-screen duration of the user video — the engine cuts to your clip at the START marker and returns to AI-generated visuals at the END marker.

Follow these rules to make sure your video inserts are never silently skipped:

Use 1 for your first uploaded video, 2 for the second, and so on. [VIDEO 1] → first item in custom_videos. If you write [VIDEO 3] but only uploaded 2 videos, that video won’t appear.
Every video needs a matching [START] and [END] pair. Use the shape [START][VIDEO N] ... narration ... [END][VIDEO N]. Writing a bare [VIDEO N] without the wrappers causes the engine to start the video but never stop it — it will keep playing over all subsequent narration and block any later [VIDEO N] blocks from rendering.
Spelling and casing must be exact. Use uppercase VIDEO with a single space before the number — typos like [VEIDO 1] or [VDIEO 1] are silently dropped.
Always put whitespace around each marker. Write Welcome. [START][VIDEO 1] Here is the footage. [END][VIDEO 1] Up next. — never glue markers to surrounding words.
Place markers at sentence boundaries, not mid-sentence. Put [START][VIDEO N] right before the first word of the sentence the video should cover, and [END][VIDEO N] right after the closing punctuation of the last sentence in that block.
Match the narration length to the user video’s original duration. The engine treats the uploaded clip’s natural length as authoritative — [END][VIDEO N] does not force a hard cut. If the clip is longer than the narration wrapped inside the block, the video keeps playing past [END] and the next narration line is delayed until the clip finishes, producing a visible silent gap. Either pre-trim the clip before uploading, or expand the narration inside the block to cover the clip’s full runtime.
Each [VIDEO N] appears exactly once. One [START] and one [END] per video — do not reuse the same N later in the script.
Do not nest video blocks. [START][VIDEO 1] ... [START][VIDEO 2] ... [END][VIDEO 1] [END][VIDEO 2] is invalid; each block must fully close before the next one opens.

NOTE:Video markers that violate any of these rules are silently dropped — no error, no warning. The position falls back to the engine's normal narration without your uploaded video.

NOTE:Heads up — clip duration vs. narration duration. The user video plays for its full uploaded length regardless of how much narration you wrap between [START][VIDEO N] and [END][VIDEO N]. If the clip is longer than the wrapped narration, the video keeps playing past [END] silently, and the next narration line is held back until the clip ends. To avoid this silent gap, either trim the clip to roughly the spoken duration of the wrapped sentences before uploading, or write enough narration inside the block to cover the clip’s full runtime.

Example — Good shape (use as a template):

video-markers-good-shape.json

{
  "custom_script": "Welcome to today's nature tour. [START][VIDEO 1] Here is a stunning view of the mountain ridge captured from the air. [END][VIDEO 1] Now let us head down to the coast. [START][VIDEO 2] These calming waves close out our journey. [END][VIDEO 2] Thanks for watching.",
  "custom_videos": [
    "https://example.com/mountain-clip.mp4",
    "https://example.com/coastal-clip.mp4"
  ],
  "custom_videos_description": [
    "Aerial view of a mountain ridge",
    "Coastal waves rolling onto a beach"
  ]
}

Each [VIDEO N] appears twice — once with [START] at the beginning of its sentence and once with [END] at the end. Video 1 plays only over “Here is a stunning view… from the air.”, then the engine returns to AI-generated visuals for “Now let us head down to the coast.”, then video 2 plays only over “These calming waves close out our journey.” Marker numbers (1, 2) match the 2-item custom_videos array. To control whether the video plays with its original audio or AI narration, see use_ai_audio_at.

Language	Accepted values	Note
English	english or en	Default when omitted
Hindi	hindi or hi	—
Spanish	spanish or es	—
French	french or fr	—
German	german or de	—
Italian	italian or it	—
Portuguese	portuguese or pt	—
Russian	russian or ru	—
Japanese	japanese or ja	—
Korean	korean or ko	—
Chinese / Mandarin	chinese, mandarin, or zh	Both map to zh
Arabic	arabic or ar	—
Dutch	dutch or nl	—
Polish	polish or pl	—
Turkish	turkish or tr	—
Swedish	swedish or sv	—
Danish	danish or da	—
Norwegian	norwegian or no	—
Finnish	finnish or fi	—
Greek	greek or el	—
Czech	czech or cs	—
Hungarian	hungarian or hu	—
Romanian	romanian or ro	—
Thai	thai or th	—
Vietnamese	vietnamese or vi	—
Indonesian	indonesian or id	—
Malay	malay or ms	—
Tamil	tamil or ta	—
Telugu	telugu or te	—
Bengali	bengali or bn	—
Marathi	marathi or mr	—
Gujarati	gujarati or gu	—
Kannada	kannada or kn	—
Malayalam	malayalam or ml	—
Punjabi	punjabi or pa	—
Urdu	urdu or ur	—

Code	Meaning	Recovery
200	Successful request (GET, POST, PATCH, or DELETE)	Consume response payload
400	Bad request (invalid fields or file types)	Check request body and field values
401	Missing or invalid authentication	Verify the x-api-key header is set and valid
403	Plan does not allow requested action	Upgrade plan or contact support
404	Video / job not found	Verify identifiers belong to the account
413	Uploaded file is 10 MB or larger	Compress or split the file before retrying
422	Validation failure (bad payload)	Inspect detail field in response
429	Rate limit exceeded	Back off and retry with exponential delay
500	Unexpected server error	Retry later; contact support if persistent

String detail

Most errors return detail as a single human-readable string describing what went wrong. Surface it directly in your UI or logs.

error-response-string.json

{
  "detail": "'canvas_style_variant' is required when using 'golpo_canvas' engine."
}

Array detail

When the request body fails validation, detail is an array of error objects — one per failing field. Each object includes the failing field path (loc) and a human-readable message (msg).

error-response-array.json

{
  "detail": [
    {
      "type": "value_error",
      "loc": ["body"],
      "msg": "Value error, 'custom_videos_description' is required when 'custom_videos' is provided. Provide one description per video (2 expected)."
    }
  ]
}

Usage-Based Plan

API Only Tier

Usage-based pricing with volume discounts

About this plan

This plan is for using Golpo within your program or application. You will not be able to use the Golpo platform to generate videos manually.

This is a usage-based plan with a minimum cost of $200 to enter. Perfect for developers and businesses who need programmatic access to video generation.

Pay only for what you use — volume-based pricing that gets better as you scale.

Platform access (manual video creation via the dashboard) is not included in this plan. If you need both platform + API access, please consider buying an enterprise plan or business plan with an API add-on.

Pricing Rates

Credit Conversion

USD

1 Credit

Golpo credit

Resource	Cost
1 min videoVideo generation	2 Credits= $2.00
Volume discounts apply at higher usage tiers. Contact us for enterprise rates.

Minimum entry: $200 (200 credits = ~100 minutes of video)

Quick Start

Endpoints

Request Body

Parameter Groups

Upload File

User Assets

Local Paths

Language Quickstarts

Display Language

Multi-Language Variants

Style Previews

Sketch Video Editing

Timing

Video Orientation

Narration Voices

Background Music

Voice Instructions

Video Instructions

Golpo Engine

Canvas Style Variant

Sketch Style Variant

Pen Animation Style

Inserting Images in Script

Inserting Videos in Script

Language Support

HTTP Status Codes

Error Response Format

API Only Tier

About this plan

Pricing Rates