An Introduction to OpenAI Function Calling | by David Hundley

[ad_1]

No extra unstructured information outputs; flip ChatGPT’s completions into structured JSON!

A number of months in the past, OpenAI launched their API to most of the people, which excited many builders who wished to utilize ChatGPT’s outputs in a scientific means. As thrilling has this has been, it’s equally been a little bit of a nightmare since we programmers are likely to work within the realm of structured information sorts. We like integers, booleans, and lists. The unstructured string might be unwieldy to cope with, and with a view to get constant outcomes, a programmer is required to face their worst nightmare: growing a daily expression (Regex) for correct parsing. 🤢

In fact, immediate engineering can truly assist fairly a bit right here, however it’s nonetheless not good. For instance, if you wish to have ChatGPT analyze the sentiment of a film evaluation for positivity or negativity, you would possibly construction a immediate that appears like this:

immediate = f'''
Please carry out a sentiment evaluation on the next film evaluation:
{MOVIE_REVIEW_TEXT}
Please output your response as a single phrase: both "Constructive" or "Unfavorable".
'''

This immediate truthfully does fairly decently, however the outcomes aren’t exactly constant. For instance, I’ve seen ChatGPT produce outputs that appear to be the next by way of the film sentiment instance:

Constructive
constructive
Constructive.

This won’t look like an enormous deal, however on this planet of programming, these are NOT equal. Once more, you will get round an easier instance like this with a little bit of Regex, however past the truth that most individuals (together with myself) are horrible at writing common expressions, there are merely some cases the place even Regex can’t parse the knowledge appropriately.

As you possibly can inform, programmers have been hoping that OpenAI would add performance to help structured JSON outputs, and OpenAI has delivered within the type of perform calling. Function calling is precisely because it sounds: it permits ChatGPT to supply arguments that may work together with a customized perform in a way that makes use of structured information sorts. Yup, no extra fancy immediate engineering and Regex to cross your fingers and hope you get the fitting final result. On this put up, we’ll cowl tips on how to make use of this new performance, however first, let’s begin with an instance of how we used to try to supply structured information outputs with immediate engineering and Regex.

Earlier than we bounce into the majority of our put up, please permit me to share a link to this Jupyter notebook in my GitHub. This pocket book incorporates all of the code I can be working (and extra) as a part of this weblog put up. Moreover, I might encourage you to check out OpenAI’s official function calling documentation for something that I could not cowl right here.

To exhibit what we used to do within the “pre-function calling days”, I wrote a small little bit of textual content about myself, and we’ll be utilizing the OpenAPI to extract bits of knowledge from this textual content. Right here is the “About Me” textual content we’ll be working with:

Hiya! My identify is David Hundley. I’m a principal machine studying engineer at State Farm. I take pleasure in studying about AI and instructing what I be taught again to others. I’ve two daughters. I drive a Tesla Mannequin 3, and my favourite online game sequence is The Legend of Zelda.

Let’s say I wish to extract the next bits of knowledge from that textual content:

Identify
Job title
Firm
Variety of kids as an integer (That is necessary!)
Automobile make
Automobile mannequin
Favourite online game sequence

Right here’s how I might engineer a few-shot immediate with a view to produce a structured JSON output:

# Engineering a immediate to extract as a lot data from "About Me" as a JSON object
about_me_prompt = f'''
Please extract data as a JSON object. Please search for the next items of knowledge.
Identify
Job title
Firm
Variety of kids as a single integer
Automobile make
Automobile mannequin
Favourite online game sequenceThat is the physique of textual content to extract the knowledge from:
{about_me}
'''
# Getting the response again from ChatGPT (gpt-3.5-turbo)
openai_response = openai.ChatCompletion.create(
mannequin = 'gpt-3.5-turbo',
messages = [{'role': 'user', 'content': about_me_prompt}]
)
# Loading the response as a JSON object
json_response = json.masses(openai_response['choices'][0]['message']['content'])
json_response

Let’s take a look at how ChatGPT returned this completion to me:

The “Pre-Operate Calling” Days (Captured by the writer)

As you possibly can see, this truly isn’t unhealthy. But it surely’s not splendid and will show to be dangerous for the next causes:

We’re not assured that OpenAI’s response will present a clear JSON output. It may have produced one thing like “Right here is your JSON:” adopted by the JSON output, that means that with a view to use json.masses() to parse the string right into a JSON object, we’d first need to strip out that little little bit of textual content that opens the response.
We’re not assured that the keys within the key-value pairs of the JSON object can be constant from API name to API name. Recall the instance from above of the three cases of the phrase Constructive. That is exactly the identical danger you run making an attempt to have ChatGPT parse out keys via few-shot immediate engineering. The one means you would possibly lock this down is with Regex, which comes with its personal baggage as we already mentioned.
We’re not assured to obtain our responses within the correct information kind format. Whereas our immediate engineering to extract variety of kids did parse into a correct integer, we’re on the mercy of crossing our fingers and hoping we get that constant outcome for each API name.

We may summarize these points right into a single assertion: With out perform calling, we aren’t assured to get constant outcomes which might be necessary for the precision required for systematic implementation. It’s a nontrivial subject that may be very difficult to treatment via immediate engineering and common expressions.

Now that we’ve constructed an instinct round why getting structured outputs from ChatGPT was previously problematic, let’s transfer into wanting on the new perform calling functionality launched by OpenAI.

Operate calling is definitely a little bit of a misnomer. OpenAI isn’t truly working your code in a real perform name. Reasonably, it’s merely establishing the structured arguments you’d have to execute your individual customized features, and I’d argue that is most well-liked conduct. When you could be pondering that it doesn’t make sense that the OpenAI API isn’t executing your customized perform, contemplate that with a view to do this, you’d need to go that perform code into ChatGPT. This perform code most likely incorporates proprietary data that you’d NOT wish to expose to anyone, therefore why it’s good that you just don’t truly need to go this code to utilize OpenAI’s perform calling.

Let’s bounce into an instance of tips on how to allow perform calling with a single customized perform. Utilizing our “About Me” pattern textual content from the earlier part, let’s create a customized perform referred to as extract_person_info. This perform wants simply three bits of knowledge: individual identify, job title, and variety of kids. (We’ll revisit extracting the remainder of the knowledge within the subsequent part; I simply wish to begin less complicated for now.) This tradition perform is deliberately quite simple and can merely take our arguments and print them collectively in a single string. Right here’s the code for this:

def extract_person_info(identify, job_title, num_children):
'''
Prints primary "About Me" dataInputs:
- identify (str): Identify of the individual
- job_title (str): Job title of the individual
- num_chilren (int): The variety of kids the dad or mum has.
'''
print(f'This individual's identify is {identify}. Their job title is {job_title}, they usually have {num_children} kids.')

With a purpose to make use of perform calling, we have to arrange a JSON object in a selected means that notes the identify of our customized perform and what information components we hope ChatGPT will extract from the physique of the textual content. Due to the specificity on how this JSON object ought to look, I might encourage you reference OpenAI’s developer documentation if you wish to know any particulars that I don’t cowl right here.

(Be aware: Within the OpenAI documentation, I observed one aspect within the JSON object referred to as required that seemingly signifies {that a} parameter should be current for ChatGPT to correctly acknowledge the perform. I attempted testing this out, and both this isn’t how this performance works or I did one thing unsuitable. Both means, I transparently don’t know what this required parameter signifies. 😅)

Right here is how we have to construction our JSON object to utilize our customized perform:

my_custom_functions = [
{
'name': 'extract_person_info',
'description': 'Get "About Me" information from the body of the input text',
'parameters': {
'type': 'object',
'properties': {
'name': {
'type': 'string',
'description': 'Name of the person'
},
'job_title': {
'type': 'string',
'description': 'Job title of the person'
},
'num_children': {
'type': 'integer',
'description': 'Number of children the person is a parent to'
}
}
}
}
]

You’re most likely already acquainted with JSON syntax, though let me draw consideration for a second to the information kind related to every property. If you’re a Python developer like myself, bear in mind that the information typing for this JSON construction is NOT straight equal to how we outline information constructions in Python. Usually talking, we are able to discover equivalencies that work out alright, however if you wish to know extra concerning the particular information sorts related to this JSON construction, check out this documentation.

Now we’re able to make our API name to get the outcomes! Utilizing the Python consumer, you’ll discover the syntax is similar to how we acquire completions normally. We’re simply going so as to add some extra arguments into this name that symbolize our perform calling:

# Getting the response again from ChatGPT (gpt-3.5-turbo)
openai_response = openai.ChatCompletion.create(
mannequin = 'gpt-3.5-turbo',
messages = [{'role': 'user', 'content': about_me}],
features = my_custom_functions,
function_call = 'auto'
)print(openai_response)

As you possibly can see, we merely go in our record of customized features (or in our case for now, our singular customized perform) because the features parameter, and also you’ll additionally discover a further parameter referred to as function_call that we’ve set to auto. Don’t fear about this for now as we’ll revisit what this auto piece is doing within the subsequent part.

Let’s run this code and try the total API response from ChatGPT

Operate calling with a single perform (Captured by the writer)

For probably the most half, this response seems the identical as a non-function name response, however now there’s a further subject within the response referred to as function_call, and nested beneath this dictionary are two extra gadgets: identify and arguments. identify signifies the identify of our customized perform that we are going to be calling with ChatGPT’s output, and arguments incorporates a string that we are able to load utilizing json.masses() to load our customized perform arguments as a JSON object.

Discover now that we’re getting rather more consistency than we have been in our pre-function calling methodology. Now we might be assured that the keys of the key-value pairs WILL be constant, and the information sorts WILL be constant. No want for fancy immediate engineering or common expressions!

That’s the core of OpenAI’s perform calling! In fact, this was a really simplistic instance to get you going, however you most likely have extra questions. Let’s cowl these on this subsequent part.

The earlier part lined a quite simple instance of tips on how to allow perform calling, however for those who’re like me, you most likely have some extra questions past this level. Naturally, I can’t cowl all these questions, however I do wish to cowl two massive ones which might be barely extra superior than what we lined within the earlier part.

What if the immediate I submit doesn’t comprise the knowledge I wish to extract per my customized perform?

In our authentic instance, our customized perform sought to extract three very particular bits of knowledge, and we demonstrated that this labored efficiently by passing in my customized “About Me” textual content as a immediate. However you could be questioning, what occurs for those who go in some other immediate that doesn’t comprise that data?

Recall that we set a parameter in our API consumer name referred to as function_call that we set to auto. We’ll discover this even deeper within the subsequent subsection, however what this parameter is basically doing is telling ChatGPT to make use of its finest judgment in determining when to construction the output for considered one of our customized features.

So what occurs once we submit a immediate that doesn’t match any of our customized features? Merely put, it defaults to typical conduct as if perform calling doesn’t exist. Let’s check this out with an arbitrary immediate: “How tall is the Eiffel Tower?”

Operate calling however with a immediate that doesn’t match the perform (Captured by the writer)

As you possibly can see, we’re getting a typical “Completions” output although we handed in our customized perform. Naturally, this is smart since this arbitrary Eiffel Towel immediate incorporates not one of the particular data we’re on the lookout for.

What if I wish to go a number of customized features and a few of them have overlapping parameters?

Briefly, ChatGPT intelligently handles this with out a downside. The place we beforehand handed in a single customized perform as basically an inventory of Python dictionaries, we simply have to preserve including extra Python dictionaries to this similar record, every representing its personal distinct perform. Let’s add two new features: one referred to as extract_vehicle_info and one other referred to as extract_all_info. Right here’s what our adjusted syntax seems like:

# Defining a perform to extract solely car data
def extract_vehicle_info(vehicle_make, vehicle_model):
'''
Prints primary car dataInputs:
- vehicle_make (str): Make of the car
- vehicle_model (str): Mannequin of the car
'''
print(f'Car make: {vehicle_make}nVehicle mannequin: {vehicle_model}')
# Defining a perform to extract all data offered within the authentic "About Me" immediate
def extract_vehicle_info(identify, job_title, num_children, vehicle_make, vehicle_model, company_name, favorite_vg_series):
'''
Prints the total "About Me" data
Inputs:
- identify (str): Identify of the individual
- job_title (str): Job title of the individual
- num_chilren (int): The variety of kids the dad or mum has
- vehicle_make (str): Make of the car
- vehicle_model (str): Mannequin of the car
- company_name (str): Identify of the corporate the individual works for
- favorite_vg_series (str): Particular person's favourite online game sequence.
'''
print(f'''
This individual's identify is {identify}. Their job title is {job_title}, they usually have {num_children} kids.
They drive a {vehicle_make} {vehicle_model}.
They work for {company_name}.
Their favourite online game sequence is {favorite_vg_series}.
''')
# Defining how we would like ChatGPT to name our customized features
my_custom_functions = [
{
'name': 'extract_person_info',
'description': 'Get "About Me" information from the body of the input text',
'parameters': {
'type': 'object',
'properties': {
'name': {
'type': 'string',
'description': 'Name of the person'
},
'job_title': {
'type': 'string',
'description': 'Job title of the person'
},
'num_children': {
'type': 'integer',
'description': 'Number of children the person is a parent to'
}
}
}
},
{
'name': 'extract_car_info',
'description': 'Extract the make and model of the person's car',
'parameters': {
'type': 'object',
'properties': {
'vehicle_make': {
'type': 'string',
'description': 'Make of the person's vehicle'
},
'vehicle_model': {
'type': 'string',
'description': 'Model of the person's vehicle'
}
}
}
},
{
'name': 'extract_all_info',
'description': 'Extract all information about a person including their vehicle make and model',
'parameters': {
'type': 'object',
'properties': {
'name': {
'type': 'string',
'description': 'Name of the person'
},
'job_title': {
'type': 'string',
'description': 'Job title of the person'
},
'num_children': {
'type': 'integer',
'description': 'Number of children the person is a parent to'
},
'vehicle_make': {
'type': 'string',
'description': 'Make of the person's vehicle'
},
'vehicle_model': {
'type': 'string',
'description': 'Model of the person's vehicle'
},
'company_name': {
'type': 'string',
'description': 'Name of the company the person works for'
},
'favorite_vg_series': {
'type': 'string',
'description': 'Name of the person's favorite video game series'
}
}
}
}
]

Discover particularly how the extract_all_info covers a number of the similar parameters as our authentic extract_person_info perform, so how does ChatGPT know which one to pick out? Merely put, ChatGPT seems for the most effective match. If we go in a immediate that incorporates all of the arguments wanted for the extract_all_info perform, that’s the one it’ll choose. But when we simply go in a immediate that incorporates both simply easy details about me or a immediate about my car, it’ll leverage the respective features that do this. Let’s execute that in code right here with just a few samples:

Pattern 1: The unique “About Me” textual content. (See above.)
Pattern 2: “My identify is David Hundley. I’m a principal machine studying engineer, and I’ve two daughters.”
Pattern 3: “She drives a Kia Sportage.”

Pattern #1’s Outcomes (Captured by the writer)

Pattern #2’s Outcomes (Captured by the writer)

With every of the respective prompts, ChatGPT chosen the proper customized perform, and we are able to particularly notice that within the identify worth beneath function_call within the API’s response object. Along with this being a helpful option to determine which perform to make use of the arguments for, we are able to programmatically map our precise customized Python perform to this worth to run the proper code appropriately. If that doesn’t make sense, maybe taking a look at this in code would make this extra clear:

# Iterating over the three samples
for i, pattern in enumerate(samples):print(f'Pattern #{i + 1}'s outcomes:')
# Getting the response again from ChatGPT (gpt-3.5-turbo)
openai_response = openai.ChatCompletion.create(
mannequin = 'gpt-3.5-turbo',
messages = [{'role': 'user', 'content': sample}],
features = my_custom_functions,
function_call = 'auto'
)['choices'][0]['message']
# Checking to see {that a} perform name was invoked
if openai_response.get('function_call'):
# Checking to see which particular perform name was invoked
function_called = openai_response['function_call']['name']
# Extracting the arguments of the perform name
function_args = json.masses(openai_response['function_call']['arguments'])
# Invoking the correct features
if function_called == 'extract_person_info':
extract_person_info(*record(function_args.values()))
elif function_called == 'extract_vehicle_info':
extract_vehicle_info(*record(function_args.values()))
elif function_called == 'extract_all_info':
extract_all_info(*record(function_args.values()))

Last programmatic outcomes! (Captured by the writer)

**Beware one factor**: Within the spirit of full transparency, I needed to run that code there a number of instances to get it to supply like that. The difficulty is that as a result of the extract_person_info and extract_all_info are extra related in nature, ChatGPT saved complicated these for each other. I assume the lesson to be realized right here is that your features needs to be extracting distinct data. I additionally solely examined utilizing gpt-3.5-turbo, so it’s doable {that a} extra highly effective mannequin like GPT-4 may have dealt with that higher.

I hope you possibly can see now why perform calling might be so highly effective! In the case of constructing purposes that leverage Generative AI, this sort of perform calling is a godsend for programmers. By not having to fret a lot now concerning the output JSON construction, we are able to now focus our time on constructing out different elements of the applying. It’s an superior time to be working on this area 🥳

[ad_2]

Source link

An Introduction to OpenAI Function Calling | by David Hundley | Jul, 2023

Telexistence picks up $170M – The Robot Report

How ‘Indiana Jones and the Dial of Destiny’ De-Aged Harrison Ford

Editor

How 'Indiana Jones and the Dial of Destiny' De-Aged Harrison Ford

Leave a Reply Cancel reply

Browse by Category

Categories

Recommended