AutoGen - Build Powerful AI Agents with ChatGPT/GPT-4

In this tutorial, we'll explore AutoGen¹, a Microsoft library that lets you create LLM applications with agents. These agents can communicate and help you solve complex tasks.

Join the AI BootCamp!

Ready to dive deep into the world of AI and Machine Learning? Join our BootCamp to transform your career with the latest skills and real-world project experience. LLMs, ML best practices, and more!

We'll begin with an introduction to AutoGen and its benefits. Then, we'll kick off with a basic example of building a single agent for analyzing stock price trends. Afterward, we'll delve into a more advanced demonstration, using four agents to construct a cryptocurrency indicator, drawing insights from historical prices and news.

In this part, we will be using Jupyter Notebook to run the code. If you prefer to follow along, you can find the notebook on GitHub: GitHub Repository (opens in a new tab)

What Makes AutoGen Cool?

Flexible Chats: AutoGen can make agents talk to each other to solve problems. This is much more powerful than just one LLM doing everything.
Customization: You can create agents to do specific tasks. You can choose which LLM to use, how people can join the conversation, and even bring in tools to help.
Human Input: AutoGen lets humans join the conversation, too. They can give their ideas and feedback to help the agents.

AutoGen is like having a bunch of smart friends who work together to get things done, and it's made with help from top-notch researchers.

Setup

You can install AutoGen with pip:

pip install -Uqqq pip --progress-bar off
pip install -qqq pyautogen==0.1.10 --progress-bar off

Let's add the required libraries:

import json
from getpass import getpass
 
import autogen
import pandas as pd
import requests
from autogen import AssistantAgent, UserProxyAgent
from IPython.display import Image

Next, you need to enter your API key for OpenAI (get yours from https://platform.openai.com/account/api-keys (opens in a new tab)):

OPENAI_API_KEY = getpass()

We'll get news and historical prices for Bitcoin and Ethereum from Alpha Vantage². You can get your API key from https://www.alphavantage.co/support/#api-key (opens in a new tab). Let's enter it:

ALPHA_VANTAGE_API_KEY = getpass()

You can also load the data for this tutorial (if you want to reproduce the results) from my Google Drive:

gdown 1YIw3kRmPmWPeVu-wJlG3uNtJ4YjBVDEq
gdown 1zBz5qC1TmweDOKsmFo2AXSjk8uI6jT7d
gdown 1BpCOBIz-3OVgW2s337n9lgxgL5z8G4mG

Simple Agent

Let's start with a simple example. We'll create an agent that can fetch historical prices for Bitcoin, Ethereum, Tesla stock and plot them.

First, we need to create a configuration for our agent:

gpt_config_list = [
    {
        "model": "gpt-4", # or use "gpt-3.5-turbo"
        "api_key": OPENAI_API_KEY,
    }
]
 
llm_config = {"config_list": gpt_config_list, "use_cache": False, "temperature": 0}

The important part is the selection of model and the API key. We'll use GPT-4 for this example. You can also use GPT-3.5 Turbo (ChatGPT). Let's create the agent:

assistant = AssistantAgent(
    name="assistant",
    llm_config=llm_config,
)

To start the conversation, we need another agent that will represent the user (you). We'll use the UserProxyAgent for this:

def is_termination_msg(data):
    has_content = "content" in data and data["content"] is not None
    return has_content and "TERMINATE" in data["content"]
 
user_proxy = UserProxyAgent(
    name="user_proxy",
    human_input_mode="NEVER",
    max_consecutive_auto_reply=10,
    is_termination_msg=is_termination_msg,
    code_execution_config={"work_dir": "coding"},
)

We'll use the is_termination_msg function to stop the conversation when the agent replies with TERMINATE anywhere in the text. The code_execution_config will tell the agent to save any assets (code, images etc) in the coding directory. Let's start the conversation:

user_proxy.initiate_chat(
    assistant,
    message="What is the current year? Compare the year-to-date gain for BTC, ETH and TESLA.",
)

The assistant goes through a couple of rounds of fixing code and replying with the results (check the Colab notebook for the complete output). Here's the final report:

assistant

The code has successfully fetched the year-to-date (YTD) gain for BTC (Bitcoin),
ETH (Ethereum), and TSLA (Tesla). Here are the results:
 
- The YTD gain for BTC (Bitcoin) is approximately 61.25%
- The YTD gain for ETH (Ethereum) is approximately 29.91%
- The YTD gain for TSLA (Tesla) is not available (NaN)
 
The NaN result for Tesla could be due to various reasons such as lack of data
for the specified period or issues with the data source. It's recommended to
check the data source or try a different one if necessary.
 
TERMINATE

Isn't that cool? For some reason, the Tesla stock price is unavailable. We can even ask the agent to plot the data for us:

user_proxy.send(
    recipient=assistant,
    message="Plot a chart of their price change YTD and save to price_ytd.png.",
)

Here's the final message from the assistant:

assistant

Great! The code has successfully fetched the data and created the plot. The plot
has been saved to a file named 'price_ytd.png'. You can find this file in the
current directory.
 
Please open the 'price_ytd.png' file to view the plot. It shows the price change
year-to-date (YTD) for BTC (Bitcoin), ETH (Ethereum), and TSLA (Tesla).
 
TERMINATE

Let's look at the plot:

The plot is not very useful because the price of Bitcoin is much higher than the price of Ethereum and Tesla stock. We can ask the assistant to scale the prices for us:

user_proxy.send(
    recipient=assistant,
    message="I can't see the ETH stock price on the chart",
)

assistant

Great! The code has successfully fetched the data and created the plot. The plot
has been saved to a file named 'price_ytd.png'. You can find this file in the
current directory.
 
Please open the 'price_ytd.png' file to view the plot. It shows the cumulative
percentage change in price year-to-date (YTD) for BTC (Bitcoin), ETH (Ethereum),
and TSLA (Tesla). This should make it easier to compare the performance of these
assets over the same period.
 
TERMINATE

Much better! We now even have the Tesla prices showing. Though, the dates are not properly showing. We can ask the assistant to fix that:

user_proxy.send(
    recipient=assistant,
    message="The dates overlap, increase the plot size",
)

assistant

Great! The code has successfully fetched the data and created the plot with a
larger size and rotated date labels. The plot has been saved to a file named
'price_ytd.png'. You can find this file in the current directory.
 
Please open the 'price_ytd.png' file to view the plot. It shows the cumulative
percentage change in price year-to-date (YTD) for BTC (Bitcoin), ETH (Ethereum),
and TSLA (Tesla). The dates on the x-axis should now be more readable.
 
TERMINATE

I am impressed. After a couple of rounds of fixing code, the assistant was able to produce a nice plot for us and save it to a file.

Build AI Agent Workforce for Cryptocurrency Analysis

Your first agent was very useful, but the AutoGen library shines when you have multiple agents working together. Let's build a cryptocurrency indicator that can analyze historical prices and news to give us recommended trading actions for Bitcoin and Ethereum.

Let's start with getting the latest news and prices for cryptocurrencies. We'll use the Alpha Vantage API to get the data:

def fetch_prices_for_symbol(symbol: str, days: int, use_cache: bool = False) -> str:
    if use_cache:
        if symbol == "BTC":
            price_df = pd.read_csv("btc_price.csv")
        else:
            price_df = pd.read_csv("eth_price.csv")
        return price_df[["date", "close", "volume"]].head(days)
 
    url = f"https://www.alphavantage.co/query?function=DIGITAL_CURRENCY_DAILY&symbol={symbol}&market=USD&apikey={ALPHA_VANTAGE_API_KEY}"
    r = requests.get(url)
    price_data = r.json()
 
    rows = []
    for key, day_data in price_data["Time Series (Digital Currency Daily)"].items():
        rows.append(
            {
                "date": key,
                "high": day_data["2a. high (USD)"],
                "low": day_data["3b. low (USD)"],
                "close": day_data["4a. close (USD)"],
                "volume": day_data["5. volume"],
            }
        )
    df = pd.DataFrame(rows)
    return df[["date", "close", "volume"]].head(days)

The fetch_prices_for_symbol function will fetch the latest prices for a given symbol (BTC or ETH). We can also use the cache to reproduce the results from this tutorial. Let's create a helper function that will fetch the prices for both cryptocurrencies and convert it to a string:

def fetch_prices(days: int) -> str:
    btc_df = fetch_prices_for_symbol("BTC", days, use_cache=True)
    eth_df = fetch_prices_for_symbol("ETH", days, use_cache=True)
 
    btc_txt = btc_df.to_string(index=None)
    eth_txt = eth_df.to_string(index=None)
 
    return f"""
$BTC prices (last {days} days)
{btc_txt}
 
$ETH prices (last {days} days)
{eth_txt}
    """.strip()

Next, we'll create a function that will fetch the latest news for the blockchain topic:

def fetch_news(latest_n: int) -> str:
    news_df = pd.read_csv("news.csv", sep=";")
    return news_df[["summary", "sentiment_score"]].head(latest_n).to_string(index=None)
    url = f"https://www.alphavantage.co/query?function=NEWS_SENTIMENT&sort=LATEST&topics=blockchain&apikey={ALPHA_VANTAGE_API_KEY}"
    r = requests.get(url)
    data = r.json()
 
    rows = []
    for item in data["feed"]:
        rows.append(
            {
                "title": item["title"],
                "summary": item["summary"],
                "time_published": item["time_published"],
                "sentiment_score": item["overall_sentiment_score"],
            }
        )
    df = pd.DataFrame(rows)
    return df[["summary", "sentiment_score"]].head(latest_n).to_string(index=None)

Again, this can use the cache to reproduce the results from this tutorial. Let's try out the fetch_prices function to fetch the 14 latest prices for Bitcoin and Ethereum:

prices_txt = fetch_prices(days=14)
print(prices_txt)

$BTC prices (last 14 days) date close volume 2023-10-11 27420.00 422.77304
2023-10-10 27390.12 22776.84383 2023-10-09 27590.12 31534.74639 2023-10-08
27917.05 19693.56921 2023-10-07 27956.67 13348.83825 2023-10-06 27931.09
37983.24277 2023-10-05 27410.39 30681.49619 2023-10-04 27778.57 29816.14200
2023-10-03 27426.46 28928.96555 2023-10-02 27494.51 57071.14241 2023-10-01
27992.57 24602.81468 2023-09-30 26962.56 12804.62307 2023-09-29 26906.96
28478.76219 2023-09-28 27021.39 44517.83491
 
$ETH prices (last 14 days) date close volume 2023-10-11 1569.76 3369.1026
2023-10-10 1567.63 240659.3287 2023-10-09 1580.13 340141.0988 2023-10-08 1632.84
149424.6635 2023-10-07 1633.57 101241.5732 2023-10-06 1645.03 220507.9286
2023-10-05 1611.79 237378.7449 2023-10-04 1646.58 222861.7167 2023-10-03 1656.88
198287.1164 2023-10-02 1662.40 378575.6898 2023-10-01 1733.79 225611.5868
2023-09-30 1670.89 136924.9141 2023-09-29 1667.45 251975.6728 2023-09-28 1652.99
331409.7015

We have our functions ready, but who and when will call them? Fortunatelly, we can tell the AutoGen agents to use them. Let's create a configuration:

llm_config = {
    "config_list": gpt_config_list,
    "use_cache": False,
    "temperature": 0,
    "functions": [
        {
            "name": "fetch_prices",
            "description": "Fetch daily prices for $BTC and $ETH",
            "parameters": {
                "type": "object",
                "properties": {
                    "days": {
                        "type": "integer",
                        "description": "number of historical days",
                    }
                },
                "required": ["days"],
            },
        },
        {
            "name": "fetch_news",
            "description": "Fetch the latest news about cryptocurrencies",
            "parameters": {
                "type": "object",
                "properties": {
                    "latest_n": {
                        "type": "integer",
                        "description": "number of latest news articles to get",
                    }
                },
                "required": ["latest_n"],
            },
        },
    ],
}

The configuration tells the agents that they can use the fetch_prices and fetch_news functions. Let's start with the first agent, the analyst:

analyst_system_message = """
Analyst. You are a senior financial analyst for a cryptocurrency indicator.
 
Follow the plan:
 - Get news and prices from the engineer
 - Give the news and prices (in compact format) to the bull and get their opinion
 - Give the news and prices (in compact format) to the bear and get their opinion
 - Write a report with BUY, SELL or HODL along with a number between 0 (extremely bearish)
and 100 (extremely bullish) based on the discussion with the bear,
the bull, and your own opinion for $ETH and $BTC.
 
Add TERMINATE to the end of the message.
"""
 
analyst = AssistantAgent(
    name="analyst",
    system_message=analyst_system_message,
    llm_config=llm_config,
    is_termination_msg=is_termination_msg,
    code_execution_config=False,
)

The analyst will make the final decision about the cryptocurrency indicator. They will get the news and prices from the engineer, then ask the bull and the bear for their opinion. Finally, they will write a report with a recommendation (BUY, SELL or HODL) and a score between 0 and 100. Let's create the engineer:

engineer_system_message = """
Engineer. You are a senior software engineer that executes the fetch_prices and
fetch_news functions as requested by the analyst.
"""
 
engineer = AssistantAgent(
    name="engineer",
    system_message=engineer_system_message,
    llm_config=llm_config,
    function_map={"fetch_prices": fetch_prices, "fetch_news": fetch_news},
    code_execution_config=False,
)

The engineer will be the only one to execute the fetch_prices and fetch_news functions. It knows how to call them based on the llm_config and the function_map. Note that the code_execution_config=False disallows calling other functions. Let's create the bull and the bear:

bull_system_message = """
Bull. You receive news and prices for $ETH and $BTC from the analyst.
You're always on the bullish side for cryptocurrencies and try to find sentiment that
supports explosive price increases in the short and long term.
You reply to the senior analyst with 1-2 sentences why the data suggests bullish times ahead.
"""
 
bull = AssistantAgent(
    name="bull",
    system_message=bull_system_message,
    code_execution_config=False,
    llm_config=llm_config,
)

The bull creates a short message that explains why the data suggests bullish times ahead and passes it back to the analyst. The bear searches for proof of bearish times ahead:

bear_system_message = """
Bear. You receive news and prices for $ETH and $BTC from the analyst.
You're always on the bearish side for cryptocurrencies and try to find sentiment that
supports explosive price drops in the short and long term.
You reply to the senior analyst with 1-2 sentences why the data suggests bearish times ahead.
"""
 
bear = AssistantAgent(
    name="bear",
    system_message=bear_system_message,
    code_execution_config=False,
    llm_config=llm_config,
)

The final agent is the user_proxy that will represent you. It will initiate the chat with the analyst and send the message that will start the conversation:

admin_system_message = """
A human admin. Interact with the analyst to discuss the cryptocurrency report.
The final report needs to be approved by this admin.
"""
 
user_proxy = UserProxyAgent(
    name="admin",
    system_message=admin_system_message,
    code_execution_config=False,
    human_input_mode="NEVER",
    is_termination_msg=is_termination_msg,
)

Note that of all agents, only the engineer can execute code. The rest of the agents can only send messages to each other. Let's create the group chat and the manager:

groupchat = autogen.GroupChat(
    agents=[user_proxy, analyst, engineer, bear, bull], messages=[], max_round=20
)
manager = autogen.GroupChatManager(groupchat=groupchat, llm_config=llm_config)

The max_round parameter tells the manager to stop the conversation after 20 rounds. We're ready to start the conversation. Let's pass our message to the manager:

message = """
Analyze $ETH and $BTC based on the latest 20 news articles and prices for the
last 14 days.
"""
user_proxy.initiate_chat(manager, clear_history=True, message=message)

The Analyst tells the Engineer to fetch the historical prices for the last 14 days. Here's the discussion after that:

Bear

The recent price data for both $BTC and $ETH shows a downward trend, indicating
bearish sentiment. The consistent decrease in prices over the past 14 days
suggests that investors are selling off their holdings, which could lead to
further price drops.

Bull

While it's true that there's been a slight decrease in prices over the past 14
days, this could be seen as a healthy correction in a broader bullish trend. The
high trading volumes for both $BTC and $ETH suggest strong market interest,
which could fuel a price surge in the near future.

The Analyst tells the Engineer to fetch the latest 20 news articles. Here's the discussion after that:

Bull

The latest news articles show a positive sentiment overall, with developments
like Circle's initiative to make USDC accessible on Polygon PoS, Immutable's
collaboration with Amazon Web Services, and Binance's commitment to invest in
promising crypto startups. These advancements suggest a bullish future for
cryptocurrencies as they gain more mainstream acceptance and infrastructure
support.

Bear

The latest news articles show a mix of sentiment, but there are several bearish
indicators. For instance, Bitfinex's parent company is considering a $150
million share repurchase amidst rising regulatory attention, suggesting
potential instability. Additionally, the collapse of crypto exchange FTX and the
halted cryptocurrency accounts associated with Hamas indicate potential risks
and volatility in the market. These factors could contribute to a bearish
outlook for cryptocurrencies.

Analyst

Based on the discussion with the bull and the bear, as well as my own analysis
of the price data and news sentiment, here is my report:
 
$BTC: The recent price trend is slightly bearish, but the overall news sentiment
is positive. The high trading volumes suggest strong market interest. However,
potential regulatory issues and market volatility could pose risks. My
recommendation is HODL with a score of 50, indicating a neutral outlook.
 
$ETH: Similar to $BTC, the price trend for $ETH is slightly bearish, but the
news sentiment is generally positive. The high trading volumes and developments
like the collaboration between Immutable and Amazon Web Services suggest a
bullish future. My recommendation is HODL with a score of 60, indicating a
slightly bullish outlook.
 
TERMINATE.

The conversation went well. The analyst was able to get the latest news and prices from the engineer, then ask the bull and the bear for their opinion. Finally, you got a report with a recommendation (HODL) for both cryptocurrencies and a related score. The analyst also gave a short explanation for their decision.

Conclusion

Honestly, I am impressed with the AutoGen library. It's a great way to build AI agents and solve tasks that a single prompt can't. I hope you enjoyed this tutorial and learned something new. If you want to learn more about AutoGen, check out the official documentation (opens in a new tab).

3,000+ people already joined

Join the The State of AI Newsletter

Every week, receive a curated collection of cutting-edge AI developments, practical tutorials, and analysis, empowering you to stay ahead in the rapidly evolving field of AI.

I won't send you any spam, ever!

AutoGen - Build Powerful AI Agents with ChatGPT/GPT-4

Join the AI BootCamp!

What Makes AutoGen Cool?

Setup

Simple Agent

Build AI Agent Workforce for Cryptocurrency Analysis

Conclusion

Join the The State of AI Newsletter

References

Footnotes