Files
memex/notes/org-skill-provider-gemini.org

2.4 KiB

SKILL: Gemini Provider Agent (Universal Literate Note)

Overview

The Gemini Provider Agent provides a dual-path interface to Google's neural models. It supports both the low-latency Developer API and the high-fidelity Web/Pro interface.

Phase A: Demand (PRD)

1. Purpose

Enable a hybrid cost/performance strategy for Google's models.

2. User Needs

  • API Path: Reliable, programmatic access for system reflexes.
  • Web Path: Zero-cost access to Pro-tier reasoning via browser session bridging.

Phase B: Blueprint (PROTOCOL)

1. Architectural Intent

Expose two distinct backends to the kernel: :gemini-api and :gemini-web.

2. Semantic Interfaces

`execute-gemini-api-request`

:signature `(execute-gemini-api-request prompt system-prompt &key model) :string` :description "Direct call to the Google AI Studio API."

`execute-gemini-web-request`

:signature `(execute-gemini-web-request prompt system-prompt) :string` :description "Routes the request through the Web Research browser proxy."

Phase D: Build (Implementation)

API Implementation

(in-package :org-agent)

(defun execute-gemini-api-request (prompt system-prompt &key model)
  "Implementation uses the standard kernel execute-gemini-request logic."
  (org-agent::execute-gemini-request prompt system-prompt :model model))

Web Implementation

(defun execute-gemini-web-request (prompt system-prompt &key model)
  (declare (ignore model))
  "Dispatches to the browser-based Web Research skill."
  (let ((full-prompt (format nil "~a~%~%Prompt: ~a" system-prompt prompt)))
    (uiop:symbol-call :org-agent.skills.org-skill-web-research :ask-gemini-web full-prompt)))

Registration

(progn
  (org-agent:register-neuro-backend :gemini-api #'execute-gemini-api-request)
  (org-agent:register-neuro-backend :gemini-web #'execute-gemini-web-request)
  
  (defskill :skill-provider-gemini
    :priority 90
    :trigger (lambda (context) nil)
    :neuro (lambda (context) nil)
    :symbolic (lambda (action context) action)))