Support image input in the chat completion request by Youho99 · Pull Request #55 · lhenault/simpleAI

Youho99 · 2024-07-10T15:28:16Z

Tested with a single image

This pull request responds to issue #54

It allows you to take into account the architecture of the OpenAI API request with an image

Example on the OpenAI documentation:

curl https://api.openai.com/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{
    "model": "gpt-4-turbo",
    "messages": [
      {
        "role": "user",
        "content": [
          {
            "type": "text",
            "text": "What'\''s in this image?"
          },
          {
            "type": "image_url",
            "image_url": {
              "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg"
            }
          }
        ]
      }
    ],
    "max_tokens": 300
  }'

The code has not been prettyfied, so we need to review that

> Tested with a single image

lhenault · 2024-07-12T09:48:15Z

Thanks for your work, will happily review this once you think it's ready (and passing the pre-commit check). If you have a working example for VLM / image processing to share, that would be a nice addition to the existing ones.

Youho99 · 2024-07-15T14:38:03Z

Don't use grpcio and grpcio-tools 1.65.0 version (remised version)

I don't know how to modify it in the poetry requirements

Youho99 · 2024-07-16T08:42:47Z

I just modified the rules regarding the versions of grpcio and grpcio-tools in the toml, and I regenerated the poetry.lock

Since this is my first time doing this, I would like to request special attention on this.

Youho99 · 2024-07-16T08:43:24Z

I will provide an example of using my feature in a second step (in another PR I think)

Youho99 · 2024-07-16T08:44:24Z

@lhenault I think you can review this PR (and change the version accordingly) :)

lhenault · 2024-08-28T10:09:28Z

Hey @Youho99 !

I tried your changes the other day and encountered a few issues, but probably because of me. Thanks again for your PR and sorry for the delay, it's very much appreciated. 😌

Let me have another look soon (and if you have a working example for image inputs that might speed up things).

Youho99 · 2024-08-28T12:14:46Z

@lhenault

In the next few days I'll get back to it, and provide an example.

Let me know if you have any problems.

Youho99 · 2025-01-09T16:26:31Z

@lhenault Hello and happy new year!

After a fews days (lol), i have finally produce an example for the image support.

Well, this one is not in the format of the examples already present in the library. We can do this work later.

Here is the project:
https://github.com/Youho99/phi-3_5-vision-onnx-simpleai

Youho99 · 2025-03-23T14:01:34Z

@lhenault any update ?

lhenault · 2025-04-10T11:22:32Z

Hey sorry I somehow missed this and the previous update. I'll have a look at it soon. Thanks a lot for the submission!

Youho99 · 2025-10-04T14:29:30Z

@lhenault
Can you reviex this PR ?

ggiret-thinkdeep added 2 commits July 10, 2024 15:18

Support image input in the chat completion request

4f312ef

> Tested with a single image

Code matching for the Stream part

3b2ae8c

Youho99 marked this pull request as ready for review July 15, 2024 14:35

ggiret-thinkdeep added 2 commits July 15, 2024 14:59

pre-commit formatting

1cd286d

Update grpc versions

2249c2a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support image input in the chat completion request#55

Support image input in the chat completion request#55
Youho99 wants to merge 4 commits into
lhenault:mainfrom
Youho99:main

Youho99 commented Jul 10, 2024

Uh oh!

lhenault commented Jul 12, 2024

Uh oh!

Youho99 commented Jul 15, 2024

Uh oh!

Youho99 commented Jul 16, 2024

Uh oh!

Youho99 commented Jul 16, 2024

Uh oh!

Youho99 commented Jul 16, 2024

Uh oh!

lhenault commented Aug 28, 2024

Uh oh!

Youho99 commented Aug 28, 2024 •

edited

Loading

Uh oh!

Youho99 commented Jan 9, 2025

Uh oh!

Youho99 commented Mar 23, 2025

Uh oh!

lhenault commented Apr 10, 2025

Uh oh!

Youho99 commented Oct 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Youho99 commented Jul 10, 2024

Uh oh!

lhenault commented Jul 12, 2024

Uh oh!

Youho99 commented Jul 15, 2024

Uh oh!

Youho99 commented Jul 16, 2024

Uh oh!

Youho99 commented Jul 16, 2024

Uh oh!

Youho99 commented Jul 16, 2024

Uh oh!

lhenault commented Aug 28, 2024

Uh oh!

Youho99 commented Aug 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Youho99 commented Jan 9, 2025

Uh oh!

Youho99 commented Mar 23, 2025

Uh oh!

lhenault commented Apr 10, 2025

Uh oh!

Youho99 commented Oct 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Youho99 commented Aug 28, 2024 •

edited

Loading