testing
| rank | capability | source |
|---|---|---|
| #101 | quarantine > Agent for `helm-chart-quality-review`. Reviews Helm chart source for quality, security, and testability defects — linting gaps, insecure securityContext, missing resource limits, absent health probes, RBAC over-permission, hardcoded secrets, and missing helm test coverage —… | Raishin/vanguard-frontier-agentic |
| #102 | quarantine > Agent for `llm-ai-pipeline-test-review`. Reviews an LLM or AI pipeline's evaluation setup for test-quality defects — missing hallucination, relevancy, faithfulness, bias, toxicity, and tool-correctness metrics; absent golden datasets; unthresholded or single-shot evals; and no… | Raishin/vanguard-frontier-agentic |
| #103 | quarantine > Agent for `playwright-e2e-execution-run`. Executes an existing Playwright E2E suite against an operator-confirmed non-production target and emits a structured run attestation. Read-only-runtime tier — default mode is static and runs nothing. | Raishin/vanguard-frontier-agentic |
| #104 | quarantine > Agent for `playwright-e2e-suite-review`. Reviews Playwright spec files, `playwright.config`, and CI workflows for flakiness, selector brittleness, test isolation defects, retry masking, and CI reliability. | Raishin/vanguard-frontier-agentic |
| #105 | quarantine > Agent for `plc-control-logic-safety-review`. Statically reviews exported IEC 61131-3 PLC program logic (Ladder Diagram, Structured Text, Function Block Diagram, Sequential Function Chart) for safety and reliability defects — E-stop implementation, output fail-safe paths, latch… | Raishin/vanguard-frontier-agentic |
| #106 | quarantine > Agent for `test-coverage-quality-review`. Reviews a test suite for assertion quality over coverage percentage — detecting coverage theater, assertion-free and tautological tests, mock over-specification, untested branches, and weak coverage gates. | Raishin/vanguard-frontier-agentic |
| #107 | quarantine > Agent for `test-flakiness-triage`. Triages flaky tests across any framework into root-cause categories, assigns a quarantine or fix path per test, and audits CI retry configuration and quarantine policy. | Raishin/vanguard-frontier-agentic |
| #108 | quarantine Use this skill when reviewing cert-manager PKI configuration for Kubernetes clusters. Trigger when the user asks about Issuer or ClusterIssuer scope, CertificateRequestPolicy coverage, certificate SAN or duration risks, trust-manager bundle distribution, SPIFFE mesh CA… | Raishin/vanguard-frontier-agentic |
| #109 | quarantine Normalize vendor-specific billing rows (AWS CUR, Azure Cost Management, GCP Billing Export, OCI) into FOCUS v1.2 columns from user-pasted CSV or JSON input. Refuses to invent column values not derivable from the input. No credentials accepted; operates on user-supplied data only. | Raishin/vanguard-frontier-agentic |
| #110 | quarantine Use this skill when reviewing marketing email list segment metadata, consent-record completeness, suppression-list coverage, and documented data-retention schedules for GDPR storage-limitation, CASL record-keeping, and CCPA deletion-right compliance. Trigger when a user provides… | Raishin/vanguard-frontier-agentic |
| #111 | quarantine Use this skill when reviewing how a CI pipeline runs tests — gating, sharding, parallelism, fail-fast behavior, artifact retention, and flaky-test quarantine wiring. Trigger when a user provides a CI workflow file (GitHub Actions, GitLab CI, CircleCI, Jenkins), asks why CI is… | Raishin/vanguard-frontier-agentic |
| #112 | quarantine Use this skill when reviewing how an LLM or AI pipeline is evaluated — metric selection, golden datasets, threshold governance, adversarial coverage, and regression gating — to determine whether low-quality or unsafe model outputs can ship undetected. Trigger when a user… | Raishin/vanguard-frontier-agentic |
| #113 | quarantine Use this skill when an operator wants to actually execute an existing Playwright end-to-end suite against a confirmed non-production target and receive a structured, attested run report — pass/fail counts, flaky tests, durations, and trace artifacts. Trigger when the user asks… | Raishin/vanguard-frontier-agentic |
| #114 | quarantine Use this skill when reviewing a Playwright end-to-end test suite for flakiness, selector brittleness, isolation defects, and CI reliability. Trigger when a user provides Playwright spec files, a playwright.config.ts/js, a CI workflow that runs Playwright, or asks why their E2E… | Raishin/vanguard-frontier-agentic |
| #115 | quarantine Use this skill when reviewing exported PLC program logic (Ladder Diagram, Structured Text, Function Block Diagram, or Sequential Function Chart) for safety and reliability defects. Trigger when a user provides exported IEC 61131-3 program source, an I/O list, a safety… | Raishin/vanguard-frontier-agentic |
| #116 | quarantine Use this skill when reviewing a test suite for assertion quality rather than coverage percentage — detecting coverage theater, where line/branch coverage is high but the tests would not catch a regression. Trigger when a user provides test files, a coverage report, or asks… | Raishin/vanguard-frontier-agentic |
| #117 | quarantine Use this skill when triaging flaky tests across any test framework — analyzing a flaky-test report, CI rerun history, or a set of intermittently failing tests to assign each a root-cause category and a remediation or quarantine path. Trigger when a user reports tests that pass… | Raishin/vanguard-frontier-agentic |
| #118 | quarantine When the user wants to plan, design, or implement an A/B test or experiment. Also use when the user mentions "A/B test," "split test," "experiment," "test this change," "variant copy," "multivariate test," or "hypothesis." For tracking implementation, see analytics-tracking. | TerminalSkills/skills |
| #119 | quarantine Generates and executes load test scripts for APIs using k6, wrk, or autocannon. Creates realistic test scenarios from OpenAPI specs, route files, or endpoint descriptions. Use when someone needs to load test, stress test, benchmark, or find the breaking point of their API.… | TerminalSkills/skills |
| #120 | quarantine Test REST and GraphQL API endpoints with structured assertions and reporting. Use when a user asks to test an API, hit an endpoint, check if an API works, validate a response, debug an API call, test authentication flows, or verify API contracts. Supports GET, POST, PUT, PATCH,… | TerminalSkills/skills |
| #121 | quarantine Assists with end-to-end testing of web applications using Cypress. Use when writing E2E tests, setting up component testing, configuring CI pipelines with parallelization, or building custom test commands. Trigger words: cypress, e2e testing, end-to-end, cypress run, cy.get,… | TerminalSkills/skills |
| #122 | quarantine Jest is a comprehensive JavaScript testing framework built by Meta, designed for zero-configuration testing of JavaScript and TypeScript applications. It provides a complete ecosystem for unit testing, integration testing, and snapshot testing with built-in code coverage,… | TerminalSkills/skills |
| #123 | quarantine When the user wants to perform load testing using Python with Locust's distributed architecture and real-time web UI. Also use when the user mentions "locust," "Python load testing," "distributed load test," "locust web UI," or "locustfile." For JavaScript-based load testing,… | TerminalSkills/skills |
| #124 | quarantine Write and maintain end-to-end tests with Playwright. Use when someone asks to "add e2e tests", "test my web app", "set up Playwright", "write browser tests", "test login flow", "visual regression testing", "test across browsers", or "automate UI testing". Covers test setup, page… | TerminalSkills/skills |
| #125 | quarantine Design, test, and iterate on AI prompts systematically using structured evaluation criteria. Use when building AI features, optimizing agent instructions, comparing prompt variants, or evaluating output quality across edge cases. Trigger words: prompt engineering, prompt… | TerminalSkills/skills |