<!doctype html>
<html lang="en">
  <head>
    <meta charset="utf-8">
    <meta name="viewport" content="width=device-width, initial-scale=1">
    <title>Cradicle Explorer</title>
    <link href="/css/bootstrap/bootstrap.min.css" rel="stylesheet">
    <style>
      .form-control-dark::placeholder {
          color: #aaa;
          opacity: 1;
      }
    </style>
    <link rel="stylesheet" href="/assets/fontawesome/css/all.min.css">
    <link rel="icon" type="image/png" href="/favicon.png">


                <link href="/css/dashboard.css" rel="stylesheet">
                </head>
                <body>
                <header class="navbar navbar-dark sticky-top bg-dark flex-md-nowrap p-0 shadow">
                  <a class="navbar-brand col-md-3 col-lg-2 me-0 px-3 fs-6" href="/">Cradicle Explorer</a>
                  <button class="navbar-toggler position-absolute d-md-none collapsed" type="button" data-bs-toggle="collapse" data-bs-target="#sidebarMenu" aria-controls="sidebarMenu" aria-expanded="false" aria-label="Toggle navigation">
                    <span class="navbar-toggler-icon"></span>
                  </button>
                  <form method="get" action="/cgi-bin/main" style="width:100%;"><input class="form-control form-control-dark w-100 rounded-0 border-0" type="text" name="q" placeholder="Search repos" aria-label="Search"></form>
                  <div class="navbar-nav flex-row">
                    <div class="nav-item text-nowrap">
                      <a class="nav-link px-3 active" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh">llama.cpp</a>
                    </div>
                  </div>
                </header>
                <div class="container-fluid">
                  <div class="row">
                    <nav id="sidebarMenu" class="col-md-3 col-lg-2 d-md-block bg-dark sidebar collapse">
                      <div class="position-sticky pt-3 sidebar-sticky">
                        <ul class="nav flex-column">
                          <li class="nav-item">
                            <a class="nav-link" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh">
                              <i class="align-text-bottom fa-solid fa-info"></i>
                              Info
                            </a>
                          </li>
                          <li class="nav-item">
                            <a class="nav-link" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&issue=list">
                              <i class="align-text-bottom fa-solid fa-layer-group"></i>
                              Issues
                            </a>
                          </li>
                          <li class="nav-item">
                            <a class="nav-link" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&patch=list">
                              <i class="align-text-bottom fa-solid fa-vest-patches"></i>
                              Patches
                            </a>
                          </li>
                          <li class="nav-item">
                            <a class="nav-link" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&wallet=list">
                              <i class="align-text-bottom fa-solid fa-wallet"></i>
                              Wallets
                            </a>
                          </li>
                          <li class="nav-item">
                            <a class="nav-link active" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=.">
                              <i class="align-text-bottom fa-solid fa-code"></i>
                              Source
                            </a>
                          </li>
                        <h6 class="sidebar-heading d-flex justify-content-between align-items-center px-3 mt-4 mb-1 text-muted text-uppercase">
                          <span></span>
                        </h6>
                        <ul class="nav flex-column mb-2">
                        
    <h6 class="sidebar-heading d-flex justify-content-between align-items-center px-3 mt-1 mb-1 text-muted text-uppercase">
      <span>Source</span>
    </h6>
    <li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=.devops"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> .devops</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=.github"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> .github</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=ci"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> ci</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=cmake"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> cmake</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=common"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> common</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=docs"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> docs</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples"><i class="fa-solid fa-folder-open" style="color:#f0c040;"></i> examples</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fbaby-llama"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> baby-llama</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fbatched-bench"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> batched-bench</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fbatched.swift"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> batched.swift</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fbatched"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> batched</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fbenchmark"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> benchmark</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fconvert-llama2c-to-ggml"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> convert-llama2c-to-ggml</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fcvector-generator"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> cvector-generator</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fembedding"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> embedding</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Feval-callback"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> eval-callback</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fexport-lora"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> export-lora</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Ffinetune"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> finetune</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fgbnf-validator"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> gbnf-validator</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fgguf-split"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> gguf-split</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fgguf"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> gguf</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fgritlm"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> gritlm</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fimatrix"><i class="fa-solid fa-folder-open" style="color:#f0c040;"></i> imatrix</a></li><li><a class="nav-link py-0" style="padding-left:48px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fimatrix%2FCMakeLists.txt"><i class="fa-solid fa-file" style="color:#888;"></i> CMakeLists.txt</a></li><li><a class="nav-link py-0 active" style="padding-left:48px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fimatrix%2FREADME.md"><i class="fa-solid fa-file" style="color:#888;"></i> README.md</a></li><li><a class="nav-link py-0" style="padding-left:48px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fimatrix%2Fimatrix.cpp"><i class="fa-solid fa-file" style="color:#888;"></i> imatrix.cpp</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Finfill"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> infill</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fjeopardy"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> jeopardy</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fllama-bench"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> llama-bench</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fllama.android"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> llama.android</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fllama.swiftui"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> llama.swiftui</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fllava"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> llava</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Flookahead"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> lookahead</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Flookup"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> lookup</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fmain-cmake-pkg"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> main-cmake-pkg</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fmain"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> main</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fparallel"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> parallel</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fpasskey"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> passkey</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fperplexity"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> perplexity</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fquantize-stats"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> quantize-stats</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fquantize"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> quantize</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fretrieval"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> retrieval</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Frpc"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> rpc</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fsave-load-state"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> save-load-state</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fserver"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> server</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fsimple"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> simple</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fspeculative"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> speculative</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fsycl"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> sycl</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Ftokenize"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> tokenize</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Ftrain-text-from-scratch"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> train-text-from-scratch</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2FCMakeLists.txt"><i class="fa-solid fa-file" style="color:#888;"></i> CMakeLists.txt</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2FMiku.sh"><i class="fa-solid fa-file" style="color:#888;"></i> Miku.sh</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fbase-translate.sh"><i class="fa-solid fa-file" style="color:#888;"></i> base-translate.sh</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fchat-13B.bat"><i class="fa-solid fa-file" style="color:#888;"></i> chat-13B.bat</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fchat-13B.sh"><i class="fa-solid fa-file" style="color:#888;"></i> chat-13B.sh</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fchat-persistent.sh"><i class="fa-solid fa-file" style="color:#888;"></i> chat-persistent.sh</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fchat-vicuna.sh"><i class="fa-solid fa-file" style="color:#888;"></i> chat-vicuna.sh</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fchat.sh"><i class="fa-solid fa-file" style="color:#888;"></i> chat.sh</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fconvert-legacy-llama.py"><i class="fa-solid fa-file" style="color:#888;"></i> convert-legacy-llama.py</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fjson-schema-pydantic-example.py"><i class="fa-solid fa-file" style="color:#888;"></i> json-schema-pydantic-example.py</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fjson_schema_to_grammar.py"><i class="fa-solid fa-file" style="color:#888;"></i> json_schema_to_grammar.py</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fllama.vim"><i class="fa-solid fa-file" style="color:#888;"></i> llama.vim</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fllm.vim"><i class="fa-solid fa-file" style="color:#888;"></i> llm.vim</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fpydantic-models-to-grammar-examples.py"><i class="fa-solid fa-file" style="color:#888;"></i> pydantic-models-to-grammar-examples.py</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fpydantic_models_to_grammar.py"><i class="fa-solid fa-file" style="color:#888;"></i> pydantic_models_to_grammar.py</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Freason-act.sh"><i class="fa-solid fa-file" style="color:#888;"></i> reason-act.sh</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fregex-to-grammar.py"><i class="fa-solid fa-file" style="color:#888;"></i> regex-to-grammar.py</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fserver-embd.py"><i class="fa-solid fa-file" style="color:#888;"></i> server-embd.py</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fserver-llama2-13B.sh"><i class="fa-solid fa-file" style="color:#888;"></i> server-llama2-13B.sh</a></li><li><a class="nav-link py-0" style="padding-left:32px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=examples%2Fts-type-to-grammar.sh"><i class="fa-solid fa-file" style="color:#888;"></i> ts-type-to-grammar.sh</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=ggml-cuda"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> ggml-cuda</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=ggml-sycl"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> ggml-sycl</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=gguf-py"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> gguf-py</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=grammars"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> grammars</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=kompute-shaders"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> kompute-shaders</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=media"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> media</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=models"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> models</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=pocs"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> pocs</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=prompts"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> prompts</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=requirements"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> requirements</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=scripts"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> scripts</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=spm-headers"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> spm-headers</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=tests"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> tests</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=vulkan-shaders"><i class="fa-solid fa-folder" style="color:#f0c040;"></i> vulkan-shaders</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=.clang-tidy"><i class="fa-solid fa-file" style="color:#888;"></i> .clang-tidy</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=.dockerignore"><i class="fa-solid fa-file" style="color:#888;"></i> .dockerignore</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=.ecrc"><i class="fa-solid fa-file" style="color:#888;"></i> .ecrc</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=.editorconfig"><i class="fa-solid fa-file" style="color:#888;"></i> .editorconfig</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=.flake8"><i class="fa-solid fa-file" style="color:#888;"></i> .flake8</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=.gitignore"><i class="fa-solid fa-file" style="color:#888;"></i> .gitignore</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=.gitmodules"><i class="fa-solid fa-file" style="color:#888;"></i> .gitmodules</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=.pre-commit-config.yaml"><i class="fa-solid fa-file" style="color:#888;"></i> .pre-commit-config.yaml</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=AUTHORS"><i class="fa-solid fa-file" style="color:#888;"></i> AUTHORS</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=CMakeLists.txt"><i class="fa-solid fa-file" style="color:#888;"></i> CMakeLists.txt</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=CMakePresets.json"><i class="fa-solid fa-file" style="color:#888;"></i> CMakePresets.json</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=CONTRIBUTING.md"><i class="fa-solid fa-file" style="color:#888;"></i> CONTRIBUTING.md</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=LICENSE"><i class="fa-solid fa-file" style="color:#888;"></i> LICENSE</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=Makefile"><i class="fa-solid fa-file" style="color:#888;"></i> Makefile</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=Package.swift"><i class="fa-solid fa-file" style="color:#888;"></i> Package.swift</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=README-sycl.md"><i class="fa-solid fa-file" style="color:#888;"></i> README-sycl.md</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=README.md"><i class="fa-solid fa-file" style="color:#888;"></i> README.md</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=SECURITY.md"><i class="fa-solid fa-file" style="color:#888;"></i> SECURITY.md</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=codecov.yml"><i class="fa-solid fa-file" style="color:#888;"></i> codecov.yml</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=convert-hf-to-gguf-update.py"><i class="fa-solid fa-file" style="color:#888;"></i> convert-hf-to-gguf-update.py</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=convert-hf-to-gguf.py"><i class="fa-solid fa-file" style="color:#888;"></i> convert-hf-to-gguf.py</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=convert-llama-ggml-to-gguf.py"><i class="fa-solid fa-file" style="color:#888;"></i> convert-llama-ggml-to-gguf.py</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=flake.lock"><i class="fa-solid fa-file" style="color:#888;"></i> flake.lock</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=flake.nix"><i class="fa-solid fa-file" style="color:#888;"></i> flake.nix</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-alloc.c"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-alloc.c</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-alloc.h"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-alloc.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-backend-impl.h"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-backend-impl.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-backend.c"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-backend.c</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-backend.h"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-backend.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-blas.cpp"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-blas.cpp</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-blas.h"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-blas.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-common.h"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-common.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-cuda.cu"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-cuda.cu</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-cuda.h"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-cuda.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-impl.h"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-impl.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-kompute.cpp"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-kompute.cpp</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-kompute.h"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-kompute.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-metal.h"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-metal.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-metal.m"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-metal.m</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-metal.metal"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-metal.metal</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-quants.c"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-quants.c</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-quants.h"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-quants.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-rpc.cpp"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-rpc.cpp</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-rpc.h"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-rpc.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-sycl.cpp"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-sycl.cpp</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-sycl.h"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-sycl.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-vulkan-shaders.hpp"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-vulkan-shaders.hpp</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-vulkan.cpp"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-vulkan.cpp</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml-vulkan.h"><i class="fa-solid fa-file" style="color:#888;"></i> ggml-vulkan.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml.c"><i class="fa-solid fa-file" style="color:#888;"></i> ggml.c</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml.h"><i class="fa-solid fa-file" style="color:#888;"></i> ggml.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=ggml_vk_generate_shaders.py"><i class="fa-solid fa-file" style="color:#888;"></i> ggml_vk_generate_shaders.py</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=kompute"><i class="fa-solid fa-file" style="color:#888;"></i> kompute</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=llama.cpp"><i class="fa-solid fa-file" style="color:#888;"></i> llama.cpp</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=llama.h"><i class="fa-solid fa-file" style="color:#888;"></i> llama.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=mypy.ini"><i class="fa-solid fa-file" style="color:#888;"></i> mypy.ini</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=pyrightconfig.json"><i class="fa-solid fa-file" style="color:#888;"></i> pyrightconfig.json</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=requirements.txt"><i class="fa-solid fa-file" style="color:#888;"></i> requirements.txt</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=sgemm.cpp"><i class="fa-solid fa-file" style="color:#888;"></i> sgemm.cpp</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=sgemm.h"><i class="fa-solid fa-file" style="color:#888;"></i> sgemm.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=unicode-data.cpp"><i class="fa-solid fa-file" style="color:#888;"></i> unicode-data.cpp</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=unicode-data.h"><i class="fa-solid fa-file" style="color:#888;"></i> unicode-data.h</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=unicode.cpp"><i class="fa-solid fa-file" style="color:#888;"></i> unicode.cpp</a></li><li><a class="nav-link py-0" style="padding-left:16px;" href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&file=unicode.h"><i class="fa-solid fa-file" style="color:#888;"></i> unicode.h</a></li>
    
                        </ul>
                      </div>
                    </nav>
                <main class="col-md-9 ms-sm-auto col-lg-10">
                  <div class="container px-1 py-3">
        
<div class="mb-2" style="font-size:1.1rem;"><a href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=.">/</a> <a href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples">examples</a> / <a href="/cgi-bin/repo?id=z6ysXz6ubEakbjB6aJfE7AXLxGqh&source=examples%2Fimatrix">imatrix</a> / README.md</div>
        <div class="list-group">
        <div class="list-group-item">
        <div class="mb-2" style="font-weight:bold;"><i class="fa-solid fa-file"></i> README.md</div>
        <pre style="margin:0; font-size:0.85rem; overflow-x:auto; color:#fafafa;"><span style="color:#666; user-select:none;"> 1</span>  # llama.cpp/examples/imatrix
<span style="color:#666; user-select:none;"> 2</span>  
<span style="color:#666; user-select:none;"> 3</span>  Compute an importance matrix for a model and given text dataset. Can be used during quantization to enchance the quality of the quantum models.
<span style="color:#666; user-select:none;"> 4</span>  More information is available here: https://github.com/ggerganov/llama.cpp/pull/4861
<span style="color:#666; user-select:none;"> 5</span>  
<span style="color:#666; user-select:none;"> 6</span>  ## Usage
<span style="color:#666; user-select:none;"> 7</span>  
<span style="color:#666; user-select:none;"> 8</span>  ```
<span style="color:#666; user-select:none;"> 9</span>  ./llama-imatrix \
<span style="color:#666; user-select:none;">10</span>      -m model.gguf -f some-text.txt [-o imatrix.dat] [--process-output] [--verbosity 1] \
<span style="color:#666; user-select:none;">11</span>      [--no-ppl] [--chunk 123] [--output-frequency 10] [--save-frequency 0] \
<span style="color:#666; user-select:none;">12</span>      [--in-file imatrix-prev-0.dat --in-file imatrix-prev-1.dat ...]
<span style="color:#666; user-select:none;">13</span>  ```
<span style="color:#666; user-select:none;">14</span>  
<span style="color:#666; user-select:none;">15</span>  Here `-m` with a model name and `-f` with a file containing training data (such as e.g. `wiki.train.raw`) are mandatory.
<span style="color:#666; user-select:none;">16</span>  The parameters in square brackets are optional and have the following meaning:
<span style="color:#666; user-select:none;">17</span>  * `-o` (or `--output-file`) specifies the name of the file where the computed data will be stored. If missing `imatrix.dat` is used.
<span style="color:#666; user-select:none;">18</span>  * `--verbosity` specifies the verbosity level. If set to `0`, no output other than the perplexity of the processed chunks will be generated. If set to `1`, each time the results are saved a message is written to `stderr`. If `&gt;=2`, a message is output each time data is collected for any tensor. Default verbosity level is `1`.
<span style="color:#666; user-select:none;">19</span>  * `--output-frequency` specifies how often the so far computed result is saved to disk. Default is 10 (i.e., every 10 chunks)
<span style="color:#666; user-select:none;">20</span>  * `--save-frequency` specifies how often to save a copy of the imatrix in a separate file. Default is 0 (i.e., never)
<span style="color:#666; user-select:none;">21</span>  * `--process-output` specifies if data will be collected for the `output.weight` tensor. My experience is that it is better to not utilize the importance matrix when quantizing `output.weight`, so this is set to `false` by default.
<span style="color:#666; user-select:none;">22</span>  
<span style="color:#666; user-select:none;">23</span>  For faster computation, make sure to use GPU offloading via the `-ngl` argument
<span style="color:#666; user-select:none;">24</span>  
<span style="color:#666; user-select:none;">25</span>  ## Example
<span style="color:#666; user-select:none;">26</span>  
<span style="color:#666; user-select:none;">27</span>  ```bash
<span style="color:#666; user-select:none;">28</span>  LLAMA_CUDA=1 make -j
<span style="color:#666; user-select:none;">29</span>  
<span style="color:#666; user-select:none;">30</span>  # generate importance matrix (imatrix.dat)
<span style="color:#666; user-select:none;">31</span>  ./llama-imatrix -m ggml-model-f16.gguf -f train-data.txt -ngl 99
<span style="color:#666; user-select:none;">32</span>  
<span style="color:#666; user-select:none;">33</span>  # use the imatrix to perform a Q4_K_M quantization
<span style="color:#666; user-select:none;">34</span>  ./llama-quantize --imatrix imatrix.dat ggml-model-f16.gguf ./ggml-model-q4_k_m.gguf q4_k_m
<span style="color:#666; user-select:none;">35</span>  ```
</pre>
        </div>
        </div>

</div>
</main>
</div>
</div>


</body>
</html>

