Inspect and validate Vertex AI Agent Engine deployments including Code Execution Sandbox, Memory Bank, A2A protocol compliance, and security posture. Generates production readiness scores. Use when asked to "inspect agent engine" or "validate depl...
Limited to specific tools
Additional assets for this skill
This skill is limited to using the following tools:
scripts/check-security.pyscripts/inspect-agent.shExpert inspector for the Vertex AI Agent Engine managed runtime. Performs comprehensive validation of deployed agents including runtime configuration, security posture, performance settings, A2A protocol compliance, and production readiness scoring.
Critical Checks:
ā
State TTL between 7-14 days (optimal for production)
ā
Sandbox type is SECURE_ISOLATED
ā
IAM permissions limited to required GCP services only
ā
Timeout configured appropriately
ā ļø State TTL < 7 days may cause premature session loss
ā State TTL > 14 days not allowed by Agent Engine
Critical Checks:
ā
Max memories >= 100 (prevents conversation truncation)
ā
Indexing enabled (fast query performance)
ā
Auto-cleanup enabled (prevents quota exhaustion)
ā
Encrypted at rest (Firestore default)
ā ļø Low memory limit may truncate long conversations
/.well-known/agent-cardPOST /v1/tasks:send responds correctlyGET /v1/tasks/{task_id} accessibleCompliance Report:
ā
AgentCard accessible and valid
ā
Task submission API functional
ā
Status polling API functional
ā
Protocol version 1.0
ā Missing AgentCard fields: [...]
ā Task API not responding (check IAM/networking)
Security Score:
š¢ SECURE (90-100%): Production ready
š” NEEDS ATTENTION (70-89%): Address issues before prod
š“ INSECURE (<70%): Do not deploy to production
Health Status:
š¢ HEALTHY: Error rate < 5%, latency < 3s (p95)
š” DEGRADED: Error rate 5-10% or latency 3-5s
š“ UNHEALTHY: Error rate > 10% or latency > 5s
Observability Score:
ā
All 5 pillars configured: Metrics, Logs, Traces, Alerts, Dashboards
ā ļø Missing alerts for critical scenarios
ā No monitoring configured (production blocker)
| Category | Weight | Checks |
|---|---|---|
| Security | 30% | 6 checks (IAM, VPC-SC, encryption, etc.) |
| Performance | 25% | 6 checks (scaling, limits, SLOs, etc.) |
| Monitoring | 20% | 6 checks (dashboards, alerts, logs, etc.) |
| Compliance | 15% | 5 checks (audit logs, DR, privacy, etc.) |
| Reliability | 10% | 5 checks (multi-region, failover, etc.) |
š¢ PRODUCTION READY (85-100%)
- All critical checks passed
- Minor optimizations recommended
- Safe to deploy
š” NEEDS IMPROVEMENT (70-84%)
- Some important checks failed
- Address issues before production
- Staging deployment acceptable
š“ NOT READY (<70%)
- Critical failures present
- Do not deploy to production
- Fix blocking issues first
1. Connect to Agent Engine
2. Retrieve agent metadata
3. Parse runtime configuration
4. Extract Code Execution settings
5. Extract Memory Bank settings
6. Document VPC configuration
1. Test AgentCard endpoint
2. Validate AgentCard structure
3. Test Task API (POST /v1/tasks:send)
4. Test Status API (GET /v1/tasks/{id})
5. Verify A2A protocol version
1. Review IAM roles and permissions
2. Check VPC Service Controls
3. Validate encryption settings
4. Scan for hardcoded secrets
5. Verify Model Armor enabled
6. Assess service account security
1. Query Cloud Monitoring metrics
2. Calculate error rate (last 24h)
3. Analyze latency percentiles
4. Review token usage and costs
5. Check auto-scaling behavior
6. Validate resource limits
1. Run all checklist items (28 checks)
2. Calculate category scores
3. Calculate overall score
4. Determine readiness status
5. Generate recommendations
6. Create action plan
Read-only inspection - Cannot modify configurations:
Agent ID: gcp-deployer-agent
Deployment Status: RUNNING
Inspection Date: 2025-12-09
Runtime Configuration:
Model: gemini-2.5-flash
Code Execution: ā
Enabled (TTL: 14 days)
Memory Bank: ā
Enabled (retention: 90 days)
VPC: ā
Configured (private-vpc-prod)
A2A Protocol Compliance:
AgentCard: ā
Valid
Task API: ā
Functional
Status API: ā
Functional
Protocol Version: 1.0
Security Posture:
IAM: ā
Least privilege (score: 95%)
VPC-SC: ā
Enabled
Model Armor: ā
Enabled
Encryption: ā
At-rest & in-transit
Overall: š¢ SECURE (92%)
Performance Metrics (24h):
Request Count: 12,450
Error Rate: 2.3% š¢
Latency (p95): 1,850ms š¢
Token Usage: 450K tokens
Cost Estimate: $12.50/day
Production Readiness:
Security: 92% (28/30 points)
Performance: 88% (22/25 points)
Monitoring: 95% (19/20 points)
Compliance: 80% (12/15 points)
Reliability: 70% (7/10 points)
Overall Score: 87% š¢ PRODUCTION READY
Recommendations:
1. Enable multi-region deployment (reliability +10%)
2. Configure automated backups (compliance +5%)
3. Add circuit breaker pattern (reliability +5%)
4. Optimize memory bank indexing (performance +3%)
Inspector checks:
Inspector checks:
Inspector checks: