struct ggma_context ggma_context
Opaque handle to a GGMA inference context.
GGMA_STATUS ggma_generate(struct ggma_context *context, ggma_token *tokens, size_t n_tokens, size_t n_tokens_max, size_t *n_tokens_out)
Generates a sequence of tokens based on the provided prompt tokens.