Agent Skills 排行榜 · 关键词 + 语义搜索

/ 聚焦搜索框
正在使用 AI 进行语义搜索...
21,657
总 Skills
38.4M
总安装量
2,456
贡献者
# Skill 仓库 描述 安装量
10351 better-all casper-studios/casper-marketplace
Documentation GitHub Repository: https://github.com/shuding/better-all Note: This library is not yet indexed in DeepWiki or Context7. better-all Library better-all provides Promise.all with automatic dependency optimization. Instead of manually analyzing which tasks can run in parallel, tasks declare dependencies inline and execution is automatically optimized. Installation pnpm add better-all Basic Usage import { all } from "better-all" ; const results = await all ( { // Independent tasks run i...
126
10352 asr answerzhao/agent-skills
When to Use User wants to transcribe an audio file to text User provides an audio file path and asks for transcription User says "转录", "识别", "transcribe", "语音转文字" When NOT to Use User wants to synthesize speech from text (use /tts ) User wants to create a podcast or explainer (use /podcast or /explainer ) Purpose Transcribe audio files to text using coli asr , which runs fully offline via local speech recognition models. No API key required. Supports Chinese, English, Japanese, Korean, and Canto...
126
10353 convex best practices waynesutton/convexskills
Convex Best Practices Build production-ready Convex applications by following established patterns for function organization, query optimization, validation, TypeScript usage, and error handling. Code Quality All patterns in this skill comply with @convex-dev/eslint-plugin . Install it for build-time validation: npm i @convex-dev/eslint-plugin --save-dev // eslint.config.js import { defineConfig } from "eslint/config" ; import convexPlugin from "@convex-dev/eslint-plugin" ; export default define...
126
10354 fba fastapi-practices/skills
官方文档: [https://fastapi-practices.github.io/fastapi_best_architecture_docs/](https://fastapi-practices.github.io/fastapi_best_architecture_docs/) 项目结构 ``` backend/ ├── main.py 应用入口 ├── run.py 启动脚本(IDE 调试) ├── cli.py CLI 命令行工具 ├── app/ 核心业务模块 │ ├── router.py 主路由汇总 │ ├── admin/ 管理后台应用 │ │ ├── api/v1/ API 路由 │ │ │ ├── auth/ 认证模块 │ │ │ ...
126
10355 typescript-expert martinholovsky/claude-skills-generator
TypeScript Expert You are an advanced TypeScript expert with deep, practical knowledge of type-level programming, performance optimization, and real-world problem solving based on current best practices. When invoked: If the issue requires ultra-specific expertise, recommend switching and stop: Deep webpack/vite/rollup bundler internals → typescript-build-expert Complex ESM/CJS migration or circular dependency analysis → typescript-module-expert Type performance profiling or compiler interna...
126
10356 axiom-foundation-models-diag charleswiltgen/axiom
Foundation Models Diagnostics Overview Foundation Models issues manifest as context window exceeded errors, guardrail violations, slow generation, availability failures, and unexpected output. Core principle 80% of Foundation Models problems stem from misunderstanding model capabilities (3B parameter device-scale model, not world knowledge), context limits (4096 tokens), or availability requirements—not framework bugs. Red Flags — Suspect Foundation Models Issue If you see ANY of these, suspe...
125
10357 oauth2-authentication manutej/luxor-claude-marketplace
OAuth2 Authentication A comprehensive skill for implementing secure authentication and authorization using OAuth2 and OpenID Connect. This skill covers all major authorization flows, token management strategies, security best practices, and real-world implementation patterns for web, mobile, and API applications. When to Use This Skill Use this skill when: Implementing user authentication in web applications, SPAs, or mobile apps Building API authorization with access tokens and refresh toke...
125
10358 cmo-advisor borghei/claude-skills
CMO Advisor Marketing leadership advisory for Chief Marketing Officers. Core Competencies Brand strategy and positioning Demand generation Marketing operations Growth marketing Revenue marketing alignment Content strategy Marketing technology Team building and leadership Marketing Strategy Framework Strategic Planning Process Phase 1: Situation Analysis Market size and dynamics Competitive landscape Customer segmentation Current performance audit Phase 2: Strategy Definition Target market ...
125
10359 slack-automation casper-studios/casper-marketplace
Slack Automation Overview Interact with Slack workspaces for channel management, message reading, and content analysis. All operations are read-only unless creating channels. Quick Decision Tree What do you need? │ ├── Search for channels by name │ └── references/search.md │ └── Script: scripts/slack_search.py search │ ├── Read messages from a channel │ └── references/fetch-news.md │ └── Script: scripts/fetch_slack_news.py │ ├── Create a new channel │ └── references/create-channel.md │...
125
10360 casper-design-system casper-studios/casper-marketplace
Casper Studios Design System A clean, elevated SaaS design system built on shadcn/ui , Tailwind CSS v4 , and React (Vite) . Every interface generated for Casper Studios — whether a client demo, internal tool, or quick prototype — must follow these rules to maintain a consistent, professional visual identity across the team. Before generating any UI code, read this file completely and the reference files listed below. You MUST read the reference files — they contain rules and code examples that a...
125
10361 agent-ready-codebase casper-studios/casper-marketplace
Agent-Ready Codebase Overview When agents struggle with a codebase, they are reflecting and amplifying the codebase's existing weaknesses. This skill evaluates codebases against five principles that determine agent effectiveness, and provides concrete guidance to improve each one. It adapts to the project's language and stack. Based on "AI Is Forcing Us To Write Good Code" . Mode Selection Determine which mode to operate in based on context: Audit : The user has an existing codebase and wants to...
125
10362 agent-browser casper-studios/casper-marketplace
Browser Automation with agent-browser The CLI uses Chrome/Chromium via CDP directly. Install via npm i -g agent-browser , brew install agent-browser , or cargo install agent-browser . Run agent-browser install to download Chrome. Core Workflow Every browser automation follows this pattern: Navigate : agent-browser open <url> Snapshot : agent-browser snapshot -i (get element refs like @e1 , @e2 ) Interact : Use refs to click, fill, select Re-snapshot : After navigation or DOM changes, get fresh r...
125
10363 mini-wiki trsoliu/mini-wiki
Wiki Generator Generate structured project Wiki to .mini-wiki/ directory. Output Structure .mini-wiki/ ├── config.yaml Configuration ├── meta.json Metadata ├── cache/ Incremental update cache ├── wiki/ Main Wiki content │ ├── index.md │ ├── architecture.md │ ├── getting-started.md │ ├── modules/ │ ├── api/ │ └── assets/ └── i18n/ Multi-language versions ├── en/ └── zh/ 🔌 Plugin Executi...
125
10364 review buiducnhat/agent-skills
Review Playwright Tests Systematically review Playwright test files for anti-patterns, missed best practices, and coverage gaps. Input $ARGUMENTS can be: A file path: review that specific test file A directory: review all test files in the directory Empty: review all tests in the project's testDir Steps 1. Gather Context Read playwright.config.ts for project settings List all *.spec.ts / *.spec.js files in scope If reviewing a single file, also check related page objects and fixtures 2. Check Ea...
125
10365 ai-video-production-master erichowens/some_claude_skills
AI Video Production Master Expert in script-to-video production pipelines for Apple Silicon Macs. Specializes in: Multiple video approaches: Stock footage, T2V (Sora-style), I2V, hybrid Hybrid local/cloud workflows for cost optimization Style and character consistency (LoRA, IPAdapter, prompt discipline) Motion graphics and synthetic elements (title cards, data viz, lower thirds) Artist commissioning for training datasets Cloud GPU orchestration (Vast.ai, RunPod) When to Use ✅ USE this skill ...
125
10366 arxiv-viewer actionbook/actionbook
arXiv Viewer Access, search, download, and read academic papers from arXiv using a hybrid API + Actionbook approach. Architecture Overview ┌─────────────────────────────────────────────────────────────┐ │ arxiv-viewer │ ├─────────────────┬─────────────────┬─────────────────────────┤ │ arXiv API │ arxiv.org Web │ ar5iv.org │ │ (WebFetch) │ (Actionbook) │ (Actionbook) │ ├─────────────────┼─────────────────┼────...
124
10367 dagster-integrations dagster-io/skills
Dagster Integrations Skill This skill is a thin wrapper around more complex and detailed reference documents. It helps guide users through workflows that require using or understanding Dagster integration libraries. Workflow Decision Tree Depending on the user's request, choose the appropriate reference file: Using a specific integration library? Try to find a references/dagster-<technology>/ folder in this directory. This will be named directly after the integration library name (e.g. dagster-d...
124
10368 learn boshu2/agentops
Learn: From Raw Materials to Published Output Prefix your first line with 🥷 inline, not as its own paragraph. Your role: collect, organize, translate, explain, structure. You support the user's thinking; you do not replace it. Pre-check Check whether /read and /write skills are installed (look for their SKILL.md in the skills directories). Warn if missing, do not block: /read missing -- Phase 1 fetch falls back to native WebFetch / curl ; coverage on paywalled, JS-heavy, and Chinese-platform pag...
124
10369 skill-creator casper-studios/casper-marketplace
Skill Creator A skill for creating new skills and iteratively improving them. At a high level, the process of creating a skill goes like this: Decide what you want the skill to do and roughly how it should do it Write a draft of the skill Create a few test prompts and run claude-with-access-to-the-skill on them Help the user evaluate the results both qualitatively and quantitatively While the runs happen in the background, draft some quantitative evals if there aren't any (if there are some, you...
124
10370 kiro-skill feiskyer/claude-code-settings
Kiro: Spec-Driven Development Workflow An interactive workflow that transforms ideas into comprehensive feature specifications, design documents, and actionable implementation plans. Quick Start When you mention creating a feature spec, design document, or implementation plan, this skill helps guide you through: Requirements → Define what needs to be built (EARS format with user stories) Design → Determine how to build it (architecture, components, data models) Tasks → Create actionable impl...
124
10371 speech-to-text martinholovsky/claude-skills-generator
ElevenLabs Speech-to-Text Transcribe audio to text with Scribe v2 - supports 90+ languages, speaker diarization, and word-level timestamps. Setup: See Installation Guide . For JavaScript, use @elevenlabs/* packages only. Quick Start Python from elevenlabs import ElevenLabs client = ElevenLabs ( ) with open ( "audio.mp3" , "rb" ) as audio_file : result = client . speech_to_text . convert ( file = audio_file , model_id = "scribe_v2" ) print ( result . text ) JavaScript import { ElevenLabsClient } ...
124
10372 clean-code vudovn/antigravity-kit
Clean Code Skill This skill embodies the principles of "Clean Code" by Robert C. Martin (Uncle Bob). Use it to transform "code that works" into "code that is clean." 🧠 Core Philosophy "Code is clean if it can be read, and enhanced by a developer other than its original author." — Grady Booch When to Use Use this skill when: Writing new code : To ensure high quality from the start. Reviewing Pull Requests : To provide constructive, principle-based feedback. Refactoring legacy code : To identify a...
124
10373 natural-dialogue-techniques wordflowlab/novel-writer-skills
自然对话写作技巧 核心原则 通过语言展现角色 每个角色都应该有独特的声音: 用词选择 受过教育的 vs 随意的 正式的 vs 俚语 技术术语 vs 日常语言 角色背景决定词汇 句子结构 短/断断续续 vs 长/流畅 完整句子 vs 片段 简单结构 vs 复杂从句 反映思维方式 言语模式 打断、停顿、重复 口头禅和填充词 独特的措辞习惯 文化和地域特色 不说什么 回避的话题 说谎的模式 沉默的时刻 未说出口的含义 潜台词胜过直白 角色的意思 vs 说的话: ❌ 直白对话(说教式) "我对你生气,因为你昨晚对我撒谎了关于你去哪里。" ✅ 富含潜台词的对话 "你的商务会议开得怎么样?" "很好。" "我确定是这样。" 为什么更好: 让读者参与推理 创造张力和不适 更真实(人们避免直接冲突) 展示角色动态 潜台词的层次 表面层:字面意义 情感层:真实感受 关系层:权力和亲密度 主题层:更大的故事意义 打断和重叠 真实对话不是有序的: 使用打断 "听着,我知道你认为——" "你不知道我在想什么。" "——但如果你让我说完——" 砰的一声,远处的门关上了。 "...
124
10374 fix buiducnhat/agent-skills
Fix Lint and Formatting Instructions Run yarn prettier to fix formatting Run yarn linc to check for remaining lint issues Report any remaining manual fixes needed Common Mistakes Running prettier on wrong files - yarn prettier only formats changed files Ignoring linc errors - These will fail CI, fix them before committing
124
10375 bootstrap buiducnhat/agent-skills
Bootstrap Development You are an expert in Bootstrap for building responsive, maintainable web interfaces. Core Principles Write clear, concise, and technical responses with precise Bootstrap examples Utilize Bootstrap's components and utilities for responsive, maintainable development Prioritize clean coding practices and descriptive class naming Minimize custom CSS by leveraging built-in components Grid System & Layout Leverage Bootstrap's grid system for responsive layouts Use container, ro...
124
10376 gmail odyssey4me/agent-skills
Gmail Gmail is a free email service provided by Google. It's widely used by individuals and businesses for sending, receiving, and organizing emails. Official docs: https://developers.google.com/gmail/api Gmail Overview Email Attachment Draft Label Thread Working with Gmail This skill uses the Membrane CLI to interact with Gmail. Membrane handles authentication and credentials refresh automatically — so you can focus on the integration logic rather than auth plumbing. Install the CLI Install the...
124
10377 axiom-hig charleswiltgen/axiom
Apple Human Interface Guidelines — Quick Reference When to Use This Skill Use when: Making visual design decisions (colors, backgrounds, typography) Reviewing UI for HIG compliance Answering "Should I use a dark background?" Choosing between design options Defending design decisions to stakeholders Quick lookups for common design questions Related Skills Use axiom-hig-ref for comprehensive details and code examples Use axiom-liquid-glass for iOS 26 material design implementation Use axiom-liqu...
124
10378 axiom-background-processing charleswiltgen/axiom
Background Processing Overview Background execution is a privilege, not a right. iOS actively limits background work to protect battery life and user experience. Core principle: Treat background tasks as discretionary jobs — you request a time window, the system decides when (or if) to run your code. Key insight: Most "my task never runs" issues stem from registration mistakes or misunderstanding the 7 scheduling factors that govern execution. This skill provides systematic debugging, not gues...
124
10379 pydantic-ai-common-pitfalls existential-birds/beagle
PydanticAI Common Pitfalls and Debugging Tool Decorator Errors Wrong: RunContext in tool_plain ERROR: RunContext not allowed in tool_plain @agent.tool_plain async def bad_tool(ctx: RunContext[MyDeps]) -> str: return "oops" UserError: RunContext annotations can only be used with tools that take context Fix: Use @agent.tool if you need context: @agent.tool async def good_tool(ctx: RunContext[MyDeps]) -> str: return "works" Wrong: Missing RunContext in tool ERROR: First param must be...
124
10380 nansen-wallet nansen-ai/nansen-cli
Wallet Auth Setup Save API key (non-interactive) nansen login --api-key < key > Or via env var: NANSEN_API_KEY = < key > nansen login Verify nansen research profiler labels --address 0xd8dA6BF26964aF9D7eEd9e03E53415D37aA96045 --chain ethereum Wallet Creation (Two-Step Agent Flow) Wallet creation requires a password from the human user . The agent must NOT generate or store the password itself. Step 1 (Agent → Human): Ask the user to provide a wallet password (minimum 12 characters). Step 2 (A...
124
10381 create-plan openai/skills
Create Plan Goal Turn a user prompt into a single, actionable plan delivered in the final assistant message. Minimal workflow Throughout the entire workflow, operate in read-only mode. Do not write or update files. Scan context quickly Read README.md and any obvious docs (docs/, CONTRIBUTING.md, ARCHITECTURE.md). Skim relevant files (the ones most likely touched). Identify constraints (language, frameworks, CI/test commands, deployment shape). Ask follow-ups only if blocking Ask at most 1...
124
10382 skill-creator skillcreatorai/ai-agent-skills
Skill Creator A skill for creating new skills and iteratively improving them. At a high level, the process of creating a skill goes like this: Decide what you want the skill to do and roughly how it should do it Write a draft of the skill Create a few test prompts and run claude-with-access-to-the-skill on them Help the user evaluate the results both qualitatively and quantitatively While the runs happen in the background, draft some quantitative evals if there aren't any (if there are some, you...
124
10383 react-patterns casper-studios/casper-marketplace
React Development Patterns Overview Expert guide for building modern React 19 applications with new concurrent features, Server Components, Actions, and advanced patterns. This skill covers everything from basic hooks to advanced server-side rendering and React Compiler optimization. When to Use Building React 19 components with TypeScript/JavaScript Managing component state with useState and useReducer Handling side effects with useEffect Optimizing performance with useMemo and useCallback Crea...
124
10384 skill creator hoangnguyen0403/agent-skills-standard
Agent Skill Creator Standard Priority: P0 (CRITICAL) Strict guidelines for High-Density Agent Skills. Maximize info/token ratio. Core Principles (Token Economy First ⚡) Progressive Loading : Load only essential content initially. Lazy References : Move detailed examples to references/ . Imperative Compression : Use verbs, abbreviations, bullet points. Context Limits : Cursor(~100k), Claude(~200k), Windsurf(~32k). Three-Level Loading System Metadata : Triggers → AGENTS.md index (Proactive Activat...
123
10385 canghe-xhs-images freestylefly/canghe-skills
Xiaohongshu Infographic Series Generator Break down complex content into eye-catching infographic series for Xiaohongshu with multiple style options. Usage Auto-select style and layout based on content /canghe-xhs-images posts/ai-future/article.md Specify style /canghe-xhs-images posts/ai-future/article.md --style notion Specify layout /canghe-xhs-images posts/ai-future/article.md --layout dense Combine style and layout /canghe-xhs-images posts/ai-future/article.md --style notion --layout li...
123
10386 find-skills brianlovin/claude-config
Find Skills This skill helps you discover and install skills from the open agent skills ecosystem. When to Use This Skill Use this skill when the user: Asks "how do I do X" where X might be a common task with an existing skill Says "find a skill for X" or "is there a skill for X" Asks "can you do X" where X is a specialized capability Expresses interest in extending agent capabilities Wants to search for tools, templates, or workflows Mentions they wish they had help with a specific domain (desi...
123
10387 axiom-ios-ai charleswiltgen/axiom
iOS Apple Intelligence Router You MUST use this skill for ANY Apple Intelligence or Foundation Models work. When to Use Use this router when: Implementing Apple Intelligence features Using Foundation Models Working with LanguageModelSession Generating structured output with @Generable Debugging AI generation issues iOS 26 on-device AI Routing Logic Foundation Models Work Implementation patterns → /skill axiom-foundation-models LanguageModelSession basics @Generable structured output Tool p...
123
10388 visual-explainer nicobailon/visual-explainer
Visual Explainer Generate self-contained HTML files for technical diagrams, visualizations, and data tables. Always open the result in the browser. Never fall back to ASCII art when this skill is loaded. Proactive table rendering. When you're about to present tabular data as an ASCII box-drawing table in the terminal (comparisons, audits, feature matrices, status reports, any structured rows/columns), generate an HTML page instead. The threshold: if the table has 4+ rows or 3+ columns, it belong...
123
10389 supabase-pentest yoanbernabeu/supabase-pentest-skills
Supabase Security Audit Orchestrator 🔵 RECOMMENDED: USE PLAN MODE FOR COMPLEX AUDITS When your environment supports Plan Mode , it is strongly recommended to activate it before starting the audit: Use the EnterPlanMode tool at the start of the orchestration Plan Mode enables better organization of multi-phase audits It allows the user to validate the approach before execution If Plan Mode is not available, proceed directly with execution Plan Mode provides better traceability and user control ov...
123
10390 tmux mitsuhiko/agent-stuff
tmux Session Control Control tmux sessions by sending keystrokes and reading output. Essential for managing Claude Code sessions. When to Use ✅ USE this skill when: Monitoring Claude/Codex sessions in tmux Sending input to interactive terminal applications Scraping output from long-running processes in tmux Navigating tmux panes/windows programmatically Checking on background work in existing sessions When NOT to Use ❌ DON'T use this skill when: Running one-off shell commands → use exec tool dir...
123
10391 axiom-camera-capture-diag charleswiltgen/axiom
Camera Capture Diagnostics Systematic troubleshooting for AVFoundation camera issues: frozen preview, wrong rotation, slow capture, session interruptions, and permission problems. Overview Core Principle: When camera doesn't work, the problem is usually: Threading (session work on main thread) - 35% Session lifecycle (not started, interrupted, not configured) - 25% Rotation (deprecated APIs, missing coordinator) - 20% Permissions (denied, not requested) - 15% Configuration (wrong preset, mis...
123
10392 axiom-build-debugging charleswiltgen/axiom
Build Debugging Overview Check dependencies BEFORE blaming code. Core principle 80% of persistent build failures are dependency resolution issues (CocoaPods, SPM, framework conflicts), not code bugs. Example Prompts These are real questions developers ask that this skill is designed to answer: 1. "I added a Swift Package but I'm getting 'No such module' errors. The package is in my Xcode project but won't compile." → The skill covers SPM resolution workflows, package cache clearing, and fra...
123
10393 x-article-publisher wshuyi/x-article-publisher-skill
X Article Publisher Publish Markdown content to X (Twitter) Articles editor, preserving formatting with rich text conversion. Prerequisites Playwright MCP for browser automation User logged into X with Premium Plus subscription Python 3.9+ with dependencies: macOS: pip install Pillow pyobjc-framework-Cocoa Windows: pip install Pillow pywin32 clip-util For Mermaid diagrams: npm install -g @mermaid-js/mermaid-cli Scripts Located in ~/.claude/skills/x-article-publisher/scripts/: parse_markdown....
123
10394 ask buiducnhat/agent-skills
Ask Purpose Gather the information needed to proceed with a task through structured, focused dialogue. This skill is for asking only . Do not plan, implement, or produce any artifacts. Scope Gate (Required Before Starting) Use this skill only when: The task is underspecified — key requirements, constraints, or decisions are missing User input is required — the task cannot proceed without answers from the user No assumptions are safe — guessing would risk wasted effort or wrong direction If the t...
123
10395 axiom-grdb charleswiltgen/axiom
GRDB Overview Direct SQLite access using GRDB.swift — a toolkit for SQLite databases with type-safe queries, migrations, and reactive observation. Core principle Type-safe Swift wrapper around raw SQL with full SQLite power when you need it. Requires iOS 13+, Swift 5.7+ License MIT (free and open source) When to Use GRDB Use raw GRDB when you need ✅ Complex SQL joins across 4+ tables ✅ Window functions (ROW_NUMBER, RANK, LAG/LEAD) ✅ Reactive queries with ValueObservation ✅ Full control over ...
123
10396 svg-precision dkyazzentwatwa/chatgpt-skills
svg-precision Generate structurally correct SVGs from a strict JSON spec (scene graph), then validate and optionally render a PNG preview. Fast path Turn the user request into a Spec JSON (use templates in references/spec.md). Build the SVG: python scripts/svg_cli.py build spec.json out.svg Validate: python scripts/svg_cli.py validate out.svg (Optional) Render a PNG preview (requires CairoSVG): python scripts/svg_cli.py render out.svg out.png --scale 2 Spec design rules (for accuracy) Always s...
123
10397 x-twitter-api tato418/x-twitter-api
X (Twitter) API v2 skill using the authenticated user's own developer credentials (OAuth 1.0a, pay-per-use). All commands go through a single entry point: node x.js <command> [flags] . Each command has its its own doc file with the full reference for flags and behavior. [!SETUP] Before first use, check whether ./node_modules exists. If it does NOT exist, run npm install . Then check whether ./dist/x.js exists. If it does NOT exist, run npm run build . NEVER cd into the skill directory; use relat...
123
10398 voice-agents casper-studios/casper-marketplace
Voice Agents You are a voice AI architect who has shipped production voice agents handling millions of calls. You understand the physics of latency - every component adds milliseconds, and the sum determines whether conversations feel natural or awkward. Your core insight: Two architectures exist. Speech-to-speech (S2S) models like OpenAI Realtime API preserve emotion and achieve lowest latency but are less controllable. Pipeline architectures (STT→LLM→TTS) give you control at each step but ad...
123
10399 macos-developer 404kidwiz/claude-supercode-skills
macOS Developer Purpose Provides native macOS application development expertise specializing in AppKit, SwiftUI for Mac, and system integration. Builds native desktop applications with XPC services, menu bar apps, and deep OS capabilities for the Apple ecosystem. When to Use Building native macOS apps (DMG/App Store) Developing Menu Bar apps (NSStatusItem) Implementing XPC Services for privilege separation Creating System Extensions (Endpoint Security, Network Extension) Porting iPad apps to M...
123
10400 multimodal-analysis 404kidwiz/claude-supercode-skills
You are an expert at analyzing and interpreting diverse media formats, extracting meaningful insights from visual content, technical diagrams, documents, and complex visual information that goes beyond simple text extraction. Purpose Provide sophisticated analysis of media files by understanding visual context, recognizing patterns, interpreting diagrams, and extracting structured information from unstructured visual content. You excel at transforming visual media into actionable, interpreted...
123