AI Completion Adapters

Used for AI-powered text completion.

Feel free to fork and implement other models, including Anthropic, Google Gemini, or any other AI model that supports text completion.

Common `complete()` signature

All completion adapters implement the same complete() method signature from the shared CompletionAdapter interface.

complete({
  content: string,
  maxTokens: number,
  outputSchema?: any,
  reasoningEffort?: 'none' | 'minimal' | 'low' | 'medium' | 'high' | 'xhigh',
  tools?: CompletionTool[],
  onChunk?: (
    chunk: string,
    event?: {
      type: 'output' | 'reasoning';
      delta: string;
      text: string;
      source?: 'summary' | 'text';
    },
  ) => void | Promise<void>,
})

reasoningEffort is optional and defaults to low in the adapter implementation.

Using `json_schema`

All completion adapters accept structured output through the shared outputSchema field in the request object. The example below uses the OpenAI adapter, but the same request shape is common for all completion adapters.

const adapter = new CompletionAdapterOpenAIResponses({
  openAiApiKey: process.env.OPENAI_API_KEY as string,
  model: 'gpt-5-mini',
});

const prompt = 'What is the capital of France? return json';

adapter.complete({
  content: prompt,
  maxTokens: 200,
  outputSchema: {
    name: 'capital_response',
    schema: {
      type: 'object',
      properties: {
        capital: { type: 'string' },
      },
      required: ['capital'],
    },
  },
}).then((resp) => {
  console.log(resp);
});

Using streaming output and reasoning events

All completion adapters accept onChunk inside the request object. The example below uses the OpenAI adapter, but the callback signature is common for all completion adapters.

Streaming reasoning events depend on the selected model and provider response.

const adapter = new CompletionAdapterOpenAIResponses({
  openAiApiKey: process.env.OPENAI_API_KEY as string,
  model: 'gpt-5-mini',
});

await adapter.complete({
  content: 'Think step by step and write a short answer about how ABS brakes work',
  maxTokens: 300,
  reasoningEffort: 'medium',
  onChunk: async (chunk, event) => {
    if (!event) return;

    if (event.type === 'reasoning') {
      console.log('Reasoning chunk:', event.delta);
      return;
    }

    console.log('Output chunk:', event.delta);
  },
});

Then output will be like:

{ content: '{"capital":"Paris"}', finishReason: 'stop' }

OpenAI Responses Completion Adapter

pnpm i @adminforth/completion-adapter-openai-responses

Integrates AdminForth with OpenAI's Responses API to provide AI-powered completion and conversational features.

The older @adminforth/completion-adapter-open-ai-chat-gpt package is deprecated. Use this package instead.

import CompletionAdapterOpenAIResponses from '@adminforth/completion-adapter-openai-responses';

new CompletionAdapterOpenAIResponses({
  openAiApiKey: process.env.OPENAI_API_KEY as string,
  model: 'gpt-5.2',
  extraRequestBodyParameters: {
    temperature: 0.7,
    reasoning: {
      effort: 'medium',
    },
  },
}),

You can specify any GPT model you need. The default is gpt-5-nano as it is cheapest though may behave weakly.

By default, this adapter uses the OpenAI responses API (v1/responses) unless useComplitionApi is set to true. If useComplitionApi is true, it uses the older Chat Completions API (v1/chat/completions).

Using with OpenAI-compatible API providers (for example based on self-hosted vLLM docker images)

Adapter can run OpenAI-compatible providers by setting baseUrl, for example you can use OVH AI Endpoints:

new CompletionAdapterOpenAIResponses({
  openAiApiKey: process.env.OVH_AI_ENDPOINTS_ACCESS_TOKEN as string,
  baseUrl: 'https://oai.endpoints.kepler.ai.cloud.ovh.net/v1',
  model: 'gpt-oss-20b',
  useComplitionApi: true,
  extraRequestBodyParameters: {
    store: false,
  },
}),

If useComplitionApi is omitted, the adapter keeps the current default behavior:

official OpenAI uses the responses API
custom baseUrl providers use the Chat Completions API.

This is because many OpenAI-compatible providers do not yet support the responses API or support it unstably (for example OVH AI Endpoints still - Apr 2026 does not fully support the responses, so useComplitionApi: false may work unstably there, though you can re-test it by manually enabling it by setting useComplitionApi: true and checking if it works). Any 3rd-party API providers might have next reasones of pure responses API compatibility:

If they use vLLM open-source software under the hood they might have outdated version
Custom non-vLLM implmentation might have reliable chat API implementation while give less priority to responses API as it is more complex and new.

We recommend you to try responses API first by setting false in useComplitionApi because it gives rich features set, including summarization and better structured outputs, and if it does not work for your provider, then set it to true to have less features but working implementation.

Google Gemini Completion Adapter

pnpm i @adminforth/completion-adapter-google-gemini

Integrates AdminForth with Google Gemini models to provide AI-powered completion and conversational features.

import CompletionAdapterGoogleGemini from '@adminforth/completion-adapter-google-gemini';

new CompletionAdapterGoogleGemini({
  geminiApiKey: process.env.GEMINI_API_KEY as string,
  model: 'gemini-3-pro-preview',
  extraRequestBodyParameters: {
    temperature: 0.7,
  },
}),

You can specify any Gemini model you need. The default is gemini-3-flash-preview.

Adding extra request body params

There might be cases where you want to add extra body params to the request sent to the AI provider. For those cases you can use extraRequestBodyParameters:

import CompletionAdapterGoogleGemini from '@adminforth/completion-adapter-google-gemini';

new CompletionAdapterGoogleGemini({
  geminiApiKey: process.env.GEMINI_API_KEY as string,
  model: 'gemini-3-pro-preview',
  extraRequestBodyParameters: {
    temperature: 0.7,
    responseMimeType: 'application/json',
  },
}),

Common complete() signature​

Using json_schema​

Using streaming output and reasoning events​

OpenAI Responses Completion Adapter​

Using with OpenAI-compatible API providers (for example based on self-hosted vLLM docker images)​

Google Gemini Completion Adapter​

Adding extra request body params​