llama.rn/docs/API at main · ArkaneFans/llama.rn

Name	Name	Last commit message	Last commit date
parent directory ..
classes	classes
interfaces	interfaces
README.md	README.md

llama.rn

Classes

Interfaces

SchemaGrammarConverterPropOrder

Type Aliases

Variables

RNLLAMA_MTMD_DEFAULT_MEDIA_MARKER

Functions

Type Aliases

BenchResult

Ƭ BenchResult: Object

Type declaration

Name	Type
`modelDesc`	`string`
`modelNParams`	`number`
`modelSize`	`number`
`ppAvg`	`number`
`ppStd`	`number`
`tgAvg`	`number`
`tgStd`	`number`

Defined in

index.ts:161

CompletionBaseParams

Ƭ CompletionBaseParams: Object

Type declaration

Name	Type
`chatTemplate?`	`string`
`chat_template?`	`string`
`jinja?`	`boolean`
`media_paths?`	`string` \| `string`[]
`messages?`	`RNLlamaOAICompatibleMessage`[]
`parallel_tool_calls?`	`object`
`prompt?`	`string`
`response_format?`	`CompletionResponseFormat`
`tool_choice?`	`string`
`tools?`	`object`

Defined in

index.ts:143

CompletionParams

Ƭ CompletionParams: Omit<NativeCompletionParams, "emit_partial_completion" | "prompt"> & CompletionBaseParams

Defined in

index.ts:155

CompletionResponseFormat

Ƭ CompletionResponseFormat: Object

Type declaration

Name	Type
`json_schema?`	{ `schema`: `object` ; `strict?`: `boolean` }
`json_schema.schema`	`object`
`json_schema.strict?`	`boolean`
`schema?`	`object`
`type`	`"text"` \| `"json_object"` \| `"json_schema"`

Defined in

index.ts:134

ContextParams

Ƭ ContextParams: Omit<NativeContextParams, "cache_type_k" | "cache_type_v" | "pooling_type"> & { cache_type_k?: "f16" | "f32" | "q8_0" | "q4_0" | "q4_1" | "iq4_nl" | "q5_0" | "q5_1" ; cache_type_v?: "f16" | "f32" | "q8_0" | "q4_0" | "q4_1" | "iq4_nl" | "q5_0" | "q5_1" ; pooling_type?: "none" | "mean" | "cls" | "last" | "rank" }

Defined in

index.ts:107

EmbeddingParams

Ƭ EmbeddingParams: NativeEmbeddingParams

Defined in

index.ts:132

FormattedChatResult

Ƭ FormattedChatResult: Object

Type declaration

Name	Type
`has_media`	`boolean`
`media_paths?`	`string`[]
`prompt`	`string`
`type`	`"jinja"` \| `"llama-chat"`

Defined in

NativeRNLlama.ts:371

JinjaFormattedChatResult

Ƭ JinjaFormattedChatResult: FormattedChatResult & { additional_stops?: string[] ; chat_format?: number ; grammar?: string ; grammar_lazy?: boolean ; grammar_triggers?: { token: number ; type: number ; value: string }[] ; preserved_tokens?: string[] }

Defined in

NativeRNLlama.ts:378

NativeCompletionParams

Ƭ NativeCompletionParams: Object

Type declaration

Name	Type	Description
`chat_format?`	`number`	-
`dry_allowed_length?`	`number`	Tokens that extend repetition beyond this receive exponentially increasing penalty: multiplier * base ^ (length of repeating sequence before token - allowed length). Default: `2`
`dry_base?`	`number`	Set the DRY repetition penalty base value. Default: `1.75`
`dry_multiplier?`	`number`	Set the DRY (Don't Repeat Yourself) repetition penalty multiplier. Default: `0.0`, which is disabled.
`dry_penalty_last_n?`	`number`	How many tokens to scan for repetitions. Default: `-1`, where `0` is disabled and `-1` is context size.
`dry_sequence_breakers?`	`string`[]	Specify an array of sequence breakers for DRY sampling. Only a JSON array of strings is accepted. Default: `['\n', ':', '"', '*']`
`emit_partial_completion`	`boolean`	-
`grammar?`	`string`	Set grammar for grammar-based sampling. Default: no grammar
`grammar_lazy?`	`boolean`	Lazy grammar sampling, trigger by grammar_triggers. Default: false
`grammar_triggers?`	{ `token`: `number` ; `type`: `number` ; `value`: `string` }[]	Lazy grammar triggers. Default: []
`ignore_eos?`	`boolean`	Ignore end of stream token and continue generating. Default: `false`
`json_schema?`	`string`	JSON schema for convert to grammar for structured JSON output. It will be override by grammar if both are set.
`logit_bias?`	`number`[][]	Modify the likelihood of a token appearing in the generated text completion. For example, use `"logit_bias": [[15043,1.0]]` to increase the likelihood of the token 'Hello', or `"logit_bias": [[15043,-1.0]]` to decrease its likelihood. Setting the value to false, `"logit_bias": [[15043,false]]` ensures that the token `Hello` is never produced. The tokens can also be represented as strings, e.g.`[["Hello, World!",-0.5]]` will reduce the likelihood of all the individual tokens that represent the string `Hello, World!`, just like the `presence_penalty` does. Default: `[]`
`media_paths?`	`string`[]	Path to an image file to process before generating text. When provided, the image will be processed and added to the context. Requires multimodal support to be enabled via initMultimodal.
`min_p?`	`number`	The minimum probability for a token to be considered, relative to the probability of the most likely token. Default: `0.05`
`mirostat?`	`number`	Enable Mirostat sampling, controlling perplexity during text generation. Default: `0`, where `0` is disabled, `1` is Mirostat, and `2` is Mirostat 2.0.
`mirostat_eta?`	`number`	Set the Mirostat learning rate, parameter eta. Default: `0.1`
`mirostat_tau?`	`number`	Set the Mirostat target entropy, parameter tau. Default: `5.0`
`n_predict?`	`number`	Set the maximum number of tokens to predict when generating text. Note: May exceed the set limit slightly if the last token is a partial multibyte character. When 0,no tokens will be generated but the prompt is evaluated into the cache. Default: `-1`, where `-1` is infinity.
`n_probs?`	`number`	If greater than 0, the response also contains the probabilities of top N tokens for each generated token given the sampling settings. Note that for temperature < 0 the tokens are sampled greedily but token probabilities are still being calculated via a simple softmax of the logits without considering any other sampler settings. Default: `0`
`n_threads?`	`number`	-
`penalty_freq?`	`number`	Repeat alpha frequency penalty. Default: `0.0`, which is disabled.
`penalty_last_n?`	`number`	Last n tokens to consider for penalizing repetition. Default: `64`, where `0` is disabled and `-1` is ctx-size.
`penalty_present?`	`number`	Repeat alpha presence penalty. Default: `0.0`, which is disabled.
`penalty_repeat?`	`number`	Control the repetition of token sequences in the generated text. Default: `1.0`
`preserved_tokens?`	`string`[]	-
`prompt`	`string`	-
`seed?`	`number`	Set the random number generator (RNG) seed. Default: `-1`, which is a random seed.
`stop?`	`string`[]	Specify a JSON array of stopping strings. These words will not be included in the completion, so make sure to add them to the prompt for the next iteration. Default: `[]`
`temperature?`	`number`	Adjust the randomness of the generated text. Default: `0.8`
`top_k?`	`number`	Limit the next token selection to the K most probable tokens. Default: `40`
`top_n_sigma?`	`number`	Top n sigma sampling as described in academic paper "Top-nσ: Not All Logits Are You Need" https://arxiv.org/pdf/2411.07641. Default: `-1.0` (Disabled)
`top_p?`	`number`	Limit the next token selection to a subset of tokens with a cumulative probability above a threshold P. Default: `0.95`
`typical_p?`	`number`	Enable locally typical sampling with parameter p. Default: `1.0`, which is disabled.
`xtc_probability?`	`number`	Set the chance for token removal via XTC sampler. Default: `0.0`, which is disabled.
`xtc_threshold?`	`number`	Set a minimum probability threshold for tokens to be removed via XTC sampler. Default: `0.1` (> `0.5` disables XTC)

Defined in

NativeRNLlama.ts:81

NativeCompletionResult

Ƭ NativeCompletionResult: Object

Type declaration

Name	Type	Description
`completion_probabilities?`	`NativeCompletionTokenProb`[]	-
`content`	`string`	Content text (Filtered text by reasoning_content / tool_calls)
`reasoning_content`	`string`	Reasoning content (parsed for reasoning model)
`stopped_eos`	`boolean`	-
`stopped_limit`	`number`	-
`stopped_word`	`string`	-
`stopping_word`	`string`	-
`text`	`string`	Original text (Ignored reasoning_content / tool_calls)
`timings`	`NativeCompletionResultTimings`	-
`tokens_cached`	`number`	-
`tokens_evaluated`	`number`	-
`tokens_predicted`	`number`	-
`tool_calls`	{ `function`: { `arguments`: `string` ; `name`: `string` } ; `id?`: `string` ; `type`: `"function"` }[]	Tool calls
`truncated`	`boolean`	-

Defined in

NativeRNLlama.ts:252

NativeCompletionResultTimings

Ƭ NativeCompletionResultTimings: Object

Type declaration

Name	Type
`predicted_ms`	`number`
`predicted_n`	`number`
`predicted_per_second`	`number`
`predicted_per_token_ms`	`number`
`prompt_ms`	`number`
`prompt_n`	`number`
`prompt_per_second`	`number`
`prompt_per_token_ms`	`number`

Defined in

NativeRNLlama.ts:241

NativeCompletionTokenProb

Ƭ NativeCompletionTokenProb: Object

Type declaration

Name	Type
`content`	`string`
`probs`	`NativeCompletionTokenProbItem`[]

Defined in

NativeRNLlama.ts:236

NativeCompletionTokenProbItem

Ƭ NativeCompletionTokenProbItem: Object

Type declaration

Name	Type
`prob`	`number`
`tok_str`	`string`

Defined in

NativeRNLlama.ts:231

NativeContextParams

Ƭ NativeContextParams: Object

Type declaration

Name	Type	Description
`cache_type_k?`	`string`	KV cache data type for the K (Experimental in llama.cpp)
`cache_type_v?`	`string`	KV cache data type for the V (Experimental in llama.cpp)
`chat_template?`	`string`	Chat template to override the default one from the model.
`ctx_shift?`	`boolean`	Enable context shifting to handle prompts larger than context size
`embd_normalize?`	`number`	-
`embedding?`	`boolean`	-
`flash_attn?`	`boolean`	Enable flash attention, only recommended in GPU device (Experimental in llama.cpp)
`is_model_asset?`	`boolean`	-
`lora?`	`string`	Single LoRA adapter path
`lora_list?`	{ `path`: `string` ; `scaled?`: `number` }[]	LoRA adapter list
`lora_scaled?`	`number`	Single LoRA adapter scale
`model`	`string`	-
`n_batch?`	`number`	-
`n_ctx?`	`number`	-
`n_gpu_layers?`	`number`	Number of layers to store in VRAM (Currently only for iOS)
`n_threads?`	`number`	-
`n_ubatch?`	`number`	-
`no_gpu_devices?`	`boolean`	Skip GPU devices (iOS only)
`pooling_type?`	`number`	-
`reasoning_format?`	`string`	-
`rope_freq_base?`	`number`	-
`rope_freq_scale?`	`number`	-
`use_mlock?`	`boolean`	-
`use_mmap?`	`boolean`	-
`use_progress_callback?`	`boolean`	-
`vocab_only?`	`boolean`	-

Defined in

NativeRNLlama.ts:8

NativeEmbeddingParams

Ƭ NativeEmbeddingParams: Object

Type declaration

Name	Type
`embd_normalize?`	`number`

Defined in

NativeRNLlama.ts:4

NativeEmbeddingResult

Ƭ NativeEmbeddingResult: Object

Type declaration

Name	Type
`embedding`	`number`[]

Defined in

NativeRNLlama.ts:310

NativeImageProcessingResult

Ƭ NativeImageProcessingResult: Object

Type declaration

Name	Type
`error?`	`string`
`prompt`	`string`
`success`	`boolean`

Defined in

NativeRNLlama.ts:391

NativeLlamaContext

Ƭ NativeLlamaContext: Object

Type declaration

Name	Type	Description
`androidLib?`	`string`	Loaded library name for Android
`contextId`	`number`	-
`gpu`	`boolean`	-
`model`	{ `chatTemplates`: { `llamaChat`: `boolean` ; `minja`: { `default`: `boolean` ; `defaultCaps`: { `parallelToolCalls`: `boolean` ; `systemRole`: `boolean` ; `toolCallId`: `boolean` ; `toolCalls`: `boolean` ; `toolResponses`: `boolean` ; `tools`: `boolean` } ; `toolUse`: `boolean` ; `toolUseCaps`: { `parallelToolCalls`: `boolean` ; `systemRole`: `boolean` ; `toolCallId`: `boolean` ; `toolCalls`: `boolean` ; `toolResponses`: `boolean` ; `tools`: `boolean` } } } ; `desc`: `string` ; `isChatTemplateSupported`: `boolean` ; `metadata`: `Object` ; `nEmbd`: `number` ; `nParams`: `number` ; `size`: `number` }	-
`model.chatTemplates`	{ `llamaChat`: `boolean` ; `minja`: { `default`: `boolean` ; `defaultCaps`: { `parallelToolCalls`: `boolean` ; `systemRole`: `boolean` ; `toolCallId`: `boolean` ; `toolCalls`: `boolean` ; `toolResponses`: `boolean` ; `tools`: `boolean` } ; `toolUse`: `boolean` ; `toolUseCaps`: { `parallelToolCalls`: `boolean` ; `systemRole`: `boolean` ; `toolCallId`: `boolean` ; `toolCalls`: `boolean` ; `toolResponses`: `boolean` ; `tools`: `boolean` } } }	-
`model.chatTemplates.llamaChat`	`boolean`	-
`model.chatTemplates.minja`	{ `default`: `boolean` ; `defaultCaps`: { `parallelToolCalls`: `boolean` ; `systemRole`: `boolean` ; `toolCallId`: `boolean` ; `toolCalls`: `boolean` ; `toolResponses`: `boolean` ; `tools`: `boolean` } ; `toolUse`: `boolean` ; `toolUseCaps`: { `parallelToolCalls`: `boolean` ; `systemRole`: `boolean` ; `toolCallId`: `boolean` ; `toolCalls`: `boolean` ; `toolResponses`: `boolean` ; `tools`: `boolean` } }	-
`model.chatTemplates.minja.default`	`boolean`	-
`model.chatTemplates.minja.defaultCaps`	{ `parallelToolCalls`: `boolean` ; `systemRole`: `boolean` ; `toolCallId`: `boolean` ; `toolCalls`: `boolean` ; `toolResponses`: `boolean` ; `tools`: `boolean` }	-
`model.chatTemplates.minja.defaultCaps.parallelToolCalls`	`boolean`	-
`model.chatTemplates.minja.defaultCaps.systemRole`	`boolean`	-
`model.chatTemplates.minja.defaultCaps.toolCallId`	`boolean`	-
`model.chatTemplates.minja.defaultCaps.toolCalls`	`boolean`	-
`model.chatTemplates.minja.defaultCaps.toolResponses`	`boolean`	-
`model.chatTemplates.minja.defaultCaps.tools`	`boolean`	-
`model.chatTemplates.minja.toolUse`	`boolean`	-
`model.chatTemplates.minja.toolUseCaps`	{ `parallelToolCalls`: `boolean` ; `systemRole`: `boolean` ; `toolCallId`: `boolean` ; `toolCalls`: `boolean` ; `toolResponses`: `boolean` ; `tools`: `boolean` }	-
`model.chatTemplates.minja.toolUseCaps.parallelToolCalls`	`boolean`	-
`model.chatTemplates.minja.toolUseCaps.systemRole`	`boolean`	-
`model.chatTemplates.minja.toolUseCaps.toolCallId`	`boolean`	-
`model.chatTemplates.minja.toolUseCaps.toolCalls`	`boolean`	-
`model.chatTemplates.minja.toolUseCaps.toolResponses`	`boolean`	-
`model.chatTemplates.minja.toolUseCaps.tools`	`boolean`	-
`model.desc`	`string`	-
`model.isChatTemplateSupported`	`boolean`	-
`model.metadata`	`Object`	-
`model.nEmbd`	`number`	-
`model.nParams`	`number`	-
`model.size`	`number`	-
`reasonNoGPU`	`string`	-

Defined in

NativeRNLlama.ts:314

NativeSessionLoadResult

Ƭ NativeSessionLoadResult: Object

Type declaration

Name	Type
`prompt`	`string`
`tokens_loaded`	`number`

Defined in

NativeRNLlama.ts:356

NativeTokenizeResult

Ƭ NativeTokenizeResult: Object

Type declaration

Name	Type	Description
`bitmap_hashes`	`number`[]	Bitmap hashes of the images
`chunk_pos`	`number`[]	Chunk positions of the text and images
`chunk_pos_images`	`number`[]	Chunk positions of the images
`has_images`	`boolean`	Whether the tokenization contains images
`tokens`	`number`[]	-

Defined in

NativeRNLlama.ts:290

RNLlamaMessagePart

Ƭ RNLlamaMessagePart: Object

Type declaration

Name	Type
`image_url?`	{ `url?`: `string` }
`image_url.url?`	`string`
`input_audio?`	{ `data?`: `string` ; `format`: `string` ; `url?`: `string` }
`input_audio.data?`	`string`
`input_audio.format`	`string`
`input_audio.url?`	`string`
`text?`	`string`
`type`	`string`

Defined in

index.ts:27

RNLlamaOAICompatibleMessage

Ƭ RNLlamaOAICompatibleMessage: Object

Type declaration

Name	Type
`content?`	`string` \| `RNLlamaMessagePart`[]
`role`	`string`

Defined in

index.ts:40

TokenData

Ƭ TokenData: Object

Type declaration

Name	Type
`completion_probabilities?`	`NativeCompletionTokenProb`[]
`token`	`string`

Defined in

index.ts:97

Variables

RNLLAMA_MTMD_DEFAULT_MEDIA_MARKER

• Const RNLLAMA_MTMD_DEFAULT_MEDIA_MARKER: "<__media__>"

Defined in

index.ts:66

Functions

addNativeLogListener

▸ addNativeLogListener(listener): Object

Parameters

Name	Type
`listener`	(`level`: `string`, `text`: `string`) => `void`

Returns

Object

Name	Type
`remove`	() => `void`

Defined in

index.ts:540

convertJsonSchemaToGrammar

▸ convertJsonSchemaToGrammar(«destructured»): string | Promise<string>

Parameters

Name	Type
`«destructured»`	`Object`
› `allowFetch?`	`boolean`
› `dotall?`	`boolean`
› `propOrder?`	`SchemaGrammarConverterPropOrder`
› `schema`	`any`

Returns

string | Promise<string>

Defined in

grammar.ts:829

initLlama

▸ initLlama(«destructured», onProgress?): Promise<LlamaContext>

Parameters

Name	Type
`«destructured»`	`ContextParams`
`onProgress?`	(`progress`: `number`) => `void`

Returns

Promise<LlamaContext>

Defined in

index.ts:581

loadLlamaModelInfo

▸ loadLlamaModelInfo(model): Promise<Object>

Parameters

Name	Type
`model`	`string`

Returns

Promise<Object>

Defined in

index.ts:566

releaseAllLlama

▸ releaseAllLlama(): Promise<void>

Returns

Promise<void>

Defined in

index.ts:647

setContextLimit

▸ setContextLimit(limit): Promise<void>

Parameters

Name	Type
`limit`	`number`

Returns

Promise<void>

Defined in

index.ts:551

toggleNativeLog

▸ toggleNativeLog(enabled): Promise<void>

Parameters

Name	Type
`enabled`	`boolean`

Returns

Promise<void>

Defined in

index.ts:536

FilesExpand file tree

API

Directory actions

More options

Directory actions

More options

Latest commit

History

API

Folders and files

parent directory

README.md

llama.rn

Table of contents

Classes

Interfaces

Type Aliases

Variables

Functions

Type Aliases

BenchResult

Type declaration

Defined in

CompletionBaseParams

Type declaration

Defined in

CompletionParams

Defined in

CompletionResponseFormat

Type declaration

Defined in

ContextParams

Defined in

EmbeddingParams

Defined in

FormattedChatResult

Type declaration

Defined in

JinjaFormattedChatResult

Defined in

NativeCompletionParams

Type declaration

Defined in

NativeCompletionResult

Type declaration

Defined in

NativeCompletionResultTimings

Type declaration

Defined in

NativeCompletionTokenProb

Type declaration

Defined in

NativeCompletionTokenProbItem

Type declaration

Defined in

NativeContextParams

Type declaration

Defined in

NativeEmbeddingParams

Type declaration

Defined in

NativeEmbeddingResult

Type declaration

Defined in

NativeImageProcessingResult

Type declaration

Defined in

NativeLlamaContext

Type declaration

Defined in

NativeSessionLoadResult

Type declaration

Defined in

NativeTokenizeResult

Type declaration

Defined in

RNLlamaMessagePart

Type declaration

Defined in

RNLlamaOAICompatibleMessage