-
Notifications
You must be signed in to change notification settings - Fork 189
Pull requests: openvinotoolkit/openvino.genai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[LLM/VLM] Stop generation when streaming callback returns true
category: continuous batching
Continuous batching
category: LLM
LLM pipeline (stateful, static)
StaticLLMPipeline: Cherry-pick num_key_value_heads not present in config.json
category: LLM
LLM pipeline (stateful, static)
category: NPU
[ SD ] Fix of scheduler config for main_pipeline
category: speculative decoding
Speculative decoding
no-match-files
add performance statistics for image generation
category: GenAI C++ API
Changes in GenAI C++ public headers
category: Python API
Python API for GenAI
category: samples
GenAI samples
category: text to image
Text 2 image pipeline
#1405
opened Dec 18, 2024 by
xufang-lisa
•
Draft
check optimum onnx fix
category: llm_bench
Label for tool/llm_bench folder
#1404
opened Dec 18, 2024 by
eaidova
Loading…
Add performance statistics for speculative decoding
category: continuous batching
Continuous batching
category: samples
GenAI samples
category: speculative decoding
Speculative decoding
#1403
opened Dec 18, 2024 by
xufang-lisa
•
Draft
Cross referencing blogs in genai samples readme
category: samples
GenAI samples
#1399
opened Dec 17, 2024 by
DimaPastushenkov
Loading…
Removed WAs for OpenVINO: pass properties as is
category: continuous batching
Continuous batching
category: GHA
CI based on Github actions
category: LLM
LLM pipeline (stateful, static)
category: Python API
Python API for GenAI
category: speculative decoding
Speculative decoding
category: text to image
Text 2 image pipeline
category: tokenizers
Tokenizer class or submodule update
category: visual language
Visual language pipeline
category: whisper
Whisper pipeline
no-match-files
[ CB ][ SD ] Implement streaming with using Continuous batching
category: LLM
LLM pipeline (stateful, static)
category: samples
GenAI samples
category: sampling
Sampling / Decoding algorithms
no-match-files
stop_strings
and include_stop_strings
via streamer & generation handling
category: continuous batching
Whisper pipeline: implement 'initial_prompt' and 'hotwords' parameters
category: GenAI C++ API
Changes in GenAI C++ public headers
category: Python API
Python API for GenAI
category: samples
GenAI samples
category: whisper
Whisper pipeline
no-match-files
[GHA] Samples tests
category: cmake / build
Cmake scripts
category: GHA
CI based on Github actions
no-match-files
[LLM Bench] Allow Image Generation Models to Run in BF16
category: llm_bench
Label for tool/llm_bench folder
[CB]Support 4-bit cache
category: continuous batching
Continuous batching
do_not_merge
no-match-files
#1366
opened Dec 12, 2024 by
zhangYiIntel
•
Draft
Dynamic KV cache allocation
category: continuous batching
Continuous batching
category: samples
GenAI samples
no-match-files
[LLM Bench] Defining Framework in Torch Compile Benchmarking
category: llm_bench
Label for tool/llm_bench folder
[WIP] LoRA for FLUX
category: GenAI C++ API
Changes in GenAI C++ public headers
category: text to image
Text 2 image pipeline
Drop check of 'import openvino'
category: cmake / build
Cmake scripts
category: Python API
Python API for GenAI
#1299
opened Dec 4, 2024 by
ilya-lavrenov
•
Draft
Add slice before matmut transformation for CB scenario
category: continuous batching
Continuous batching
category: LLM
LLM pipeline (stateful, static)
category: sampling
Sampling / Decoding algorithms
category: speculative decoding
Speculative decoding
no-match-files
[VLM] Image resize model
category: GHA
CI based on Github actions
category: tokenizers
Tokenizer class or submodule update
category: visual language
Visual language pipeline
Parallel sampling with threadpool
category: continuous batching
Continuous batching
category: sampling
Sampling / Decoding algorithms
no-match-files
Previous Next
ProTip!
Follow long discussions with comments:>50.