Feature Request: Support extra_body Parameter for Advanced Model Features #898

wqj97 · 2024-11-13T03:42:28Z

Background
vLLM currently supports various model features through configuration parameters, but lacks support for passing additional model-specific parameters through extra_body, which is particularly important for features like structured output.
https://github.com/vllm-project/vllm/blob/v0.6.0/vllm/engine/arg_utils.py#L276

Current OpenAI implementation

completion = client.chat.completions.create(
    model="gpt-3.5-turbo",
    messages=[{"role": "user", "content": "Generate a user profile"}],
    extra_body={
        "guided_json": Test.schema_json,
        "guided_decoding_backend": "lm-format-enforcer"
    }
)

Proposed implementation

resp, err := integrations.LLMClient.Client.CreateChatCompletion(
		ctx,
		openai.ChatCompletionRequest{
			Model: "...",
			Messages: []openai.ChatCompletionMessage{
				...
			},
			ExtraBody: map[string]any{
				....
			},
		},
	)

nagar-ajay · 2024-11-21T19:15:03Z

This can be useful incase of other engines as well e.g. Nvidia NIMs - https://build.nvidia.com/nvidia/nv-embedqa-e5-v5?snippet_tab=Python

wqj97 added the enhancement New feature or request label Nov 13, 2024

AyushSawant18588 mentioned this issue Nov 24, 2024

Support for extra_body parameter for embeddings API #906

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Support extra_body Parameter for Advanced Model Features #898

Feature Request: Support extra_body Parameter for Advanced Model Features #898

wqj97 commented Nov 13, 2024

nagar-ajay commented Nov 21, 2024

Feature Request: Support extra_body Parameter for Advanced Model Features #898

Feature Request: Support extra_body Parameter for Advanced Model Features #898

Comments

wqj97 commented Nov 13, 2024

Current OpenAI implementation

Proposed implementation

nagar-ajay commented Nov 21, 2024