Releases · roboflow/inference

20 May 12:49

v0.11.0

a73803b

v0.11.0

🚀 Added

🎉 PaliGemma in `inference`! 🎉

You've probably heard about new PaliGemma model, right? We have it supported in new release of inference thanks to @probicheaux.

To run the model, you need to build and inference server your GPU machine using the following commands:

# clone the inference repo
git clone https://github.com/roboflow/inference.git

# navigate into repository root
cd inference

# build inference server with PaliGemma dependencies
docker build -t roboflow/roboflow-inference-server-paligemma -f docker/dockerfiles/Dockerfile.paligemma .

 # run server
docker run -p 9001:9001 roboflow/roboflow-inference-server-paligemma

👉 To prompt the model visit our examples 📖 or use the following code snippet:

import base64
import requests
import os

PORT = 9001
API_KEY = os.environ["ROBOFLOW_API_KEY"]
IMAGE_PATH = "<PATH-TO-YOUR>/image.jpg"

def encode_bas64(image_path: str):
    with open(image_path, "rb") as image:
        x = image.read()
        image_string = base64.b64encode(x)
    return image_string.decode("ascii")

def do_gemma_request(image_path: str, prompt: str):
    infer_payload = {
        "image": {
            "type": "base64",
            "value": encode_bas64(image_path),
        },
        "api_key": API_KEY,
        "prompt": prompt
    }
    response = requests.post(
        f'http://localhost:{PORT}/llm/paligemma',
        json=infer_payload,
    )
    return response.json()


print(do_gemma_request(
    image_path=IMAGE_PATH, 
    prompt="Describe the image"
))

🌱 Changed

documentations updates:
- document source_id parameter of VideoFrame by @sberan in #395
- fix workflows specification URL and other docs updates by @SolomonLake in #398
- add link to Roboflow licensing by @capjamesg in #403

🔨 Fixed

Bug introduced into InferencePipeline.init_with_workflow(...) in v0.10.0 causing import errors yielding misleading error message informing about broken dependencies:

inference.core.exceptions.CannotInitialiseModelError: Could not initialise workflow processing due to lack of dependencies required. Please provide an issue report under https://github.com/roboflow/inference/issues

Fixed with this PR #407

Full Changelog: v0.10.0...v0.11.0

Contributors

sberan, SolomonLake, and 2 other contributors

Assets 2

14 May 12:25

PawelPeczek-Roboflow

v0.10.0

65103da

v0.10.0

🚀 Added

🎊 Core modules of `workflows` are `Apache-2.0` now

We're excited to announce that the core of workflows is now open-source under the Apache-2.0 license! We invite the community to explore the workflows ecosystem and contribute to its growth. We have plenty of ideas for improvements and would love to hear your feedback.

Feel free to check out our examples and docs 📖 .

🏗️ Roboflow `workflows` are changing before our eyes

We've undergone a major refactor of the workflows Execution Engine to make it more robust:

blocks can now be stand-alone modules - what makes them separated from Execution Engine
bocks now expose OpenAPI manifests for automatic parsing and validation
custom plugins with blocks can be created, installed via pip, and integrated with our core library blocks.

Thanks to @SkalskiP and @stellasphere we've made the documentation much better. Relying on new blocks self-describing capabilities we can now automatically generate workflows docs - you can now see exactly how to connect different blocks and how JSON definitions should look like.

Visit our docs 📖 to discover more

❗ There are minor breaking changes in manifests of some steps (DetectionsFilter, DetectionsConsensus, ActiveLearningDataCollector) as we needed to fix shortcuts made in initial version. Migration would require plugging output of another step into fields image_metadata, prediction_type of mentioned blocks.

🔧 `inference --version`

Thanks to @Griffin-Sullivan we have now a new command in inference-cli available to show details on what version of inference* packages are installed.

inference --version

🌱 Changed

Huge general docs upgrade by @LinasKo (#385, #378, #372) fixing broken links, general structure and aliases for keypoints coco-models

🔨 Fixed

Inconsistency in builds due to release of fastapi package by @grzegorz-roboflow #374
Middleware error in inference server - making every response not getting HTTP 2xx into HTTP 500 😢 - introduced in v0.9.23 - thanks @probicheaux for taking the effort to fix it
bug that was present in post-processing of all instance-segmentation models making batch inference faulty when some image yields zero predictions - huge kudos to @grzegorz-roboflow for spotting the problem and fixing it.

🏅 New Contributors

@Griffin-Sullivan made their first contribution in #339

Full Changelog: v0.9.23...v0.10.0

Contributors

LinasKo, probicheaux, and 4 other contributors

Assets 2

30 Apr 17:37

skylargivens

v0.9.23

c8a5da7

v0.9.23

What's Changed

Improve benchmark output; fix exception handling by @grzegorz-roboflow in #354
Minor docs update, API key in InferenceHTTPClient by @LinasKo in #357
Add api key fallback for model monitoring by @hansent in #366
Downgrade transformers to avoid faulty release of that package by @PawelPeczek-Roboflow in #363
Upped skypilot version by @bigbitbus in #367
Lock Grounding DINO package version to 0.2.0 by @skylargivens in #368

New Contributors

@LinasKo made their first contribution in #357

Full Changelog: v0.9.22...v0.9.23

Contributors

skylargivens, hansent, and 4 other contributors

Assets 2

18 Apr 16:01

PacificDou

v0.9.22

f78f5a2

v0.9.22

What's Changed

Add new endpoints for workflows and prepare for future deprecation by @PawelPeczek-Roboflow in #336
Update description for workflows steps by @grzegorz-roboflow in #345
Add error status code to benchmark output by @grzegorz-roboflow in #351
Add more test cases to cover tests/inference/unit_tests/core/utils/test_postprocess.py::post_process_polygons by @grzegorz-roboflow in #352
Inference TensorRT execution provider container revival by @probicheaux in #347
Bugfix for gaze detection (batch request) by @PacificDou in #358
Allow alternate video sources by @sberan in #348
Skip encode image as jpeg if no-resize is specified by @PacificDou in #359

New Contributors

@grzegorz-roboflow made their first contribution in #345

Full Changelog: v0.9.20...v0.9.22

Contributors

sberan, PacificDou, and 3 other contributors

Assets 2

27 Mar 16:48

probicheaux

v0.9.20

9cca25b

v0.9.20

What's Changed

Bump version for pypi wheels

Full Changelog: v0.9.19...v0.9.20

Assets 2

27 Mar 16:07

probicheaux

v0.9.19

56458cd

v0.9.19

GroundingDINO bugfixes and enhancements!

Allows users to pass custom box_threshold and text_threshold params to Grounding DINO core model.
Update docs to reflect box_threshold and text_threshold params.
Fixes error by filtering out detections where text similarity is lower than text_threshold and Grounding DINO returns None for class ID.
Fixes images passed to Grounding DINO model being loaded as RBG instead of BGR.
Adds NMS to Grounding DINO, optionally using class agnostic NMS via CLASS_AGNOSTIC_NMS env var.

Try it out:

from inference.models.grounding_dino import GroundingDINO

model = GroundingDINO(api_key="")

results = model.infer(
    {
        "image": {
            "type": "url",
            "value": "https://media.roboflow.com/fruit.png",
        },
        "text": ["apple"],

        # Optional params
        "box_threshold": 0.5
        "text_threshold": 0.5
    }
)

print(results.json())

Full Changelog: v0.9.18...v0.9.19

Assets 2

25 Mar 12:08

PawelPeczek-Roboflow

v0.9.18

2185ed7

v0.9.18

🚀 Added

🎥 🎥 Multiple video sources 🤝 `InferencePipeline`

Previous versions of the InferencePipeline could only support a single video source. However, from now on, you can pass multiple videos into a single pipeline and have all of them processed! Here is a demo:

demo_short.mp4

Here's how to achieve the result:

from inference import InferencePipeline
from inference.core.interfaces.stream.sinks import render_boxes

pipeline = InferencePipeline.init(
    video_reference=["your_video.mp4", "your_other_ideo.mp4"],
    model_id="yolov8n-640",
    on_prediction=render_boxes,
)
pipeline.start()
pipeline.join()

There were a lot of internal changes made, but the majority of users should not experience any breaking changes. Please visit our 📖 documentation to discover all the differences. If you are affected by the changes we needed to introduce, here is the 🔧 migration guide.

Barcode detector in `workflows`

Thanks to @chandlersupple, we have ability to detect and read barcodes in workflows.

Visit our 📖 documentation to see how to bring this step into your workflow.

🌱 Changed

Easier data collection in `inference` 🔥

We've introduced a new parameter handled by the inference server (including hosted inference at Roboflow platform). This parameter, called active_learning_target_dataset, can now be added to requests to specify the Roboflow project where collected data should be stored.

Thanks to this change, you can now collect datasets while using Universe models. We've also updated Active Learning 📖 docs

from inference_sdk import InferenceHTTPClient, InferenceConfiguration

# prepare and set configuration
configuration = InferenceConfiguration(
    active_learning_target_dataset="my_dataset",
)
client = InferenceHTTPClient(
    api_url="https://detect.roboflow.com",
    api_key="<YOUR_ROBOFLOW_API_KEY>",
).configure(configuration)

# run normal request and have your data sampled 🤯 
client.infer(
    "./path_to/your_image.jpg",
    model_id="yolov8n-640",
)

Other changes

Add inference_id to batches created by AL by @robiscoding in #319
Improvements in 📖 documentation regarding RGB vs BGR topic by @probicheaux in #330

🔨 Fixed

Thanks to contribution of @hvaria 🏅 we have two problems solved:

Ensure Graceful Interruption of Benchmark Process - Fixing for Bug #313: in #325
Better error handling in inference CLI: in #328

New Contributors

@chandlersupple made their first contribution in #311

Full Changelog: v0.9.17...v0.9.18

Contributors

probicheaux, chandlersupple, and 2 other contributors

Assets 2

15 Mar 13:50

PawelPeczek-Roboflow

v0.9.17

a5dc38a

v0.9.17

🚀 Added

YOLOWorld - new versions and Roboflow hosted inference 🤯

inference package now support 5 new versions of YOLOWorld model. We've added versions x, v2-s, v2-m, v2-l, v2-x. Versions with prefix v2 have better performance than the previously published ones.

To use YOLOWorld in inference, use the following model_id: yolo_world/<version>, substituting <version> with one of [s, m, l, x, v2-s, v2-m, v2-l, v2-x].

You can use the models in different contexts:

Roboflow hosted `inference` - easiest way to get your predictions 💥

💡 Please make sure you have inference-sdk installed

If you do not have the whole inference package installed, you will need to install at leastinference-sdk:

pip install inference-sdk

💡 You need Roboflow account to use our hosted platform

import cv2
from inference_sdk import InferenceHTTPClient

client = InferenceHTTPClient(api_url="https://infer.roboflow.com", api_key="<YOUR_ROBOFLOW_API_KEY>")
image = cv2.imread("<path_to_your_image>")
results = client.infer_from_yolo_world(
    image,
    ["person", "backpack", "dog", "eye", "nose", "ear", "tongue"],
    model_version="s",  # <-- you do not need to provide `yolo_world/` prefix here
)

Self-hosted `inference` server

💡 Please remember to clean up old version of docker image

If you ever used inference server before, please run:

docker rmi roboflow/roboflow-inference-server-cpu:latest

# or, if you have GPU on the machine
docker rmi roboflow/roboflow-inference-server-gpu:latest

in order to make sure the newest version of image is pulled.

💡 Please make sure you run the server and have sdk installed

If you do not have the whole inference package installed, you will need to install at least inference-cli and inference-sdk:

pip install inference-sdk inference-cli

Make sure you start local instance of inference server before running the code

inference server start

import cv2
from inference_sdk import InferenceHTTPClient

client = InferenceHTTPClient(api_url="http://127.0.0.1:9001")
image = cv2.imread("<path_to_your_image>")
results = client.infer_from_yolo_world(
    image,
    ["person", "backpack", "dog", "eye", "nose", "ear", "tongue"],
    model_version="s",  # <-- you do not need to provide `yolo_world/` prefix here
)

In `inference` Python package

💡 Please remember to install inference with yolo-world extras

pip install "inference[yolo-world]"

import cv2
from inference.models import YOLOWorld

image = cv2.imread("<path_to_your_image>")
model = YOLOWorld(model_id="yolo_world/s")
results = model.infer(
    image, 
    ["person", "backpack", "dog", "eye", "nose", "ear", "tongue"]
)

🌱 Changed

Track source for remote execution flows by @tonylampada in #320
Improved documentation by @capjamesg in #321

New Contributors

@tonylampada made their first contribution in #320 🥇

Full Changelog: v0.9.16...v0.9.17

Contributors

tonylampada and capjamesg

Assets 2

11 Mar 15:48

PawelPeczek-Roboflow

v0.9.16

14fe2a9

v0.9.16

🚀 Added

🎬 `InferencePipeline` can now process the video using your custom logic

Prior to v0.9.16, InferencePipeline was only able to make inference against Roboflow models. Now - you can inject any arbitrary logic of your choice and process videos (files and streams) using custom function you create. Just look at the example:

import os
import json
from inference.core.interfaces.camera.entities import VideoFrame
from inference import InferencePipeline

TARGET_DIR = "./my_predictions"

class MyModel:

  def __init__(self, weights_path: str):
    self._model = your_model_loader(weights_path)

  def infer(self, video_frame: VideoFrame) -> dict:
    return self._model(video_frame.image)


def save_prediction(prediction: dict, video_frame: VideoFrame) -> None:
  with open(os.path.join(TARGET_DIR, f"{video_frame.frame_id}.json")) as f:
    json.dump(prediction, f)

my_model = MyModel("./my_model.pt")

pipeline = InferencePipeline.init_with_custom_logic(
  video_reference="./my_video.mp4",
  on_video_frame=my_model.infer,   # <-- your custom video frame processing function
  on_prediction=save_prediction,  # <-- your custom sink for predictions
)

# start the pipeline
pipeline.start()
# wait for the pipeline to finish
pipeline.join()

That's not everything! Remember our workflows feature? We've just added workflows into InferencePipeline (in experimental mode). Check InferencePipeline.init_with_workflow(...) to test the feature.

❗ Breaking change: we've reverted changes introduced in v0.9.15 to InferencePipeline.init(...) making it compatible with YOLOWorld model. Now, you would need to use InferencePipeline.init_with_yolo_world(...) as shown here:

pipeline = InferencePipeline.init_with_yolo_world(
      video_reference="YOUR-VIDEO"
      on_prediction=...,
      classes=["person", "dog", "car", "truck"]
  )

We've updated 📖 docs to make it easy to use new feature.

Thanks @paulguerrie for great contribution

🌱 Changed

Huge changes in 📖 docs - thanks @capjamesg, @SkalskiP, @SolomonLake for contribution
Improved contributor experience by adding contributor guide and separating GHA CI, such that most important tests could work against repository fork
OpenVINO as default ONNX Execution Provider for x86 based docker images to improve speed of inference (@probicheaux )
Camera properties in InferencePipeline can be set now by caller (@sberan)

🔨 Fixed

added missing structlog dependency to package (@paulguerrie)
clarified models licence (@yeldarby)
bugs in lambda HTTP inference
fixed portion of security vulnerabilities
❗ breaking: Two exceptions (WorkspaceLoadError, MalformedWorkflowResponseError), when raised will be given HTTP502 error, instead of HTTP500 as previously
bug in workflows with class-filter at the level of detection-based model blocks not being applied.

New Contributors

@hansent made their first contribution in #293
@hvaria made their first contribution in #302

Full Changelog: v0.9.15...v0.9.16

Contributors

sberan, hansent, and 7 other contributors

Assets 2

28 Feb 15:29

paulguerrie

v0.9.15

4d1fd72

v0.9.15

What's Changed

YOLO-World Inference Pipeline by @paulguerrie in #282
QR code workflow step by @sberan in #286
Add structured API logger by @PawelPeczek-Roboflow in #287
Feature/yolov9 by @probicheaux in #290

Full Changelog: v0.9.14...v0.9.15

Contributors

sberan, probicheaux, and 2 other contributors

Assets 2

Releases: roboflow/inference

v0.11.0

🚀 Added

🎉 PaliGemma in inference! 🎉

🌱 Changed

🔨 Fixed

Contributors

v0.10.0

🚀 Added

🎊 Core modules of workflows are Apache-2.0 now

🏗️ Roboflow workflows are changing before our eyes

🔧 inference --version

🌱 Changed

🔨 Fixed

🏅 New Contributors

Contributors

v0.9.23

What's Changed

New Contributors

Contributors

v0.9.22

What's Changed

New Contributors

Contributors

v0.9.20

What's Changed

v0.9.19

v0.9.18

🚀 Added

🎥 🎥 Multiple video sources 🤝 InferencePipeline

Barcode detector in workflows

🌱 Changed

Easier data collection in inference 🔥

Other changes

🔨 Fixed

New Contributors

Contributors

v0.9.17

🚀 Added

YOLOWorld - new versions and Roboflow hosted inference 🤯

Roboflow hosted inference - easiest way to get your predictions 💥

Self-hosted inference server

In inference Python package

🌱 Changed

New Contributors

Contributors

v0.9.16

🚀 Added

🎬 InferencePipeline can now process the video using your custom logic

🌱 Changed

🔨 Fixed

New Contributors

Contributors

v0.9.15

What's Changed

Contributors

🎉 PaliGemma in `inference`! 🎉

🎊 Core modules of `workflows` are `Apache-2.0` now

🏗️ Roboflow `workflows` are changing before our eyes

🔧 `inference --version`

🎥 🎥 Multiple video sources 🤝 `InferencePipeline`

Barcode detector in `workflows`

Easier data collection in `inference` 🔥

Roboflow hosted `inference` - easiest way to get your predictions 💥

Self-hosted `inference` server

In `inference` Python package

🎬 `InferencePipeline` can now process the video using your custom logic