AI agent tutorial

In this section of the Apify Academy, we show you how to build an AI agent with the CrewAI Python framework. You’ll learn how to create an agent for Instagram analysis and integrate it with LLMs and Apify Actors.

AI agents are goal-oriented systems that make independent decisions. They interact with environments using predefined tools and workflows to automate complex tasks.

On Apify, AI agents are built as Actors—serverless cloud programs for web scraping, data processing, and AI deployment. Apify evolved from running scrapers in the cloud to supporting LLMs that follow predefined workflows with dynamically defined goals.

Prerequisites

To build an effective AI agent, you need prompts to guide it, tools for external interactions, a large language model (LLM) to connect the components, an agentic framework to handle LLM behavior, and a platform to run, deploy, and scale the solution.

Benefits of using Apify for AI agents

Apify provides a complete platform for building and deploying AI agents with the following benefits:

Serverless execution - without infrastructure management
Stateful execution - with agent memory capabilities
Monetization options - through usage-based charging
Extensive tool ecosystem - with thousands of available Actors
Scalability and reliability - for production environments
Pre-integrated tools - for web scraping and automation

Building an AI agent

Step 1: Define the use case

This tutorial creates a social media analysis agent that analyzes Instagram posts based on user queries using the Instagram Scraper Actor.

Example:

Input: "Analyze the last 10 posts from @openai and summarize AI trends."
Output: Trend analysis based on post content.

Step 2: Configure input and output

Define the input format (URL, JSON configuration, or text query) and output format (text response or structured data) for your agent.

Example input:

User query: "Analyze @openai posts for AI trends"
OpenAI model selection (e.g., gpt-4)

Example output:

Text response with insights
Data stored in Apify Dataset

Agent memory

Agents can include memory for storing information between conversations. Single-task agents typically do not require memory.

Step 3: Set up the development environment

Install the Apify CLI, which allows you to create, run, and deploy Actors from your local machine.

npm install -g @apify/cli

Create a new Actor project from the CrewAI template and navigate into the new directory.

apify create agent-actor -t python-crewai
cd agent-actor

Step 4: Understand the project structure

The template includes:

.actor/ – Actor configuration files.
- actor.json – The Actor's definition.
- input_schema.json – Defines the UI for the Actor's input.
- dataset_schema.json – Defines the structure of the output data.
- pay_per_event.json – Configuration for monetization.
src/ – Source code
- main.py – The main script for Actor execution, agent, and task definition.
- tools.py – Implementations of the tools the agent can use.
- models.py – Pydantic models for structured tool output.
- ppe_utils.py – Helper functions for pay-per-event monetization.

Step 5: Define input and output schemas

Update .actor/input_schema.json to define the Actor's inputs. This schema generates a user interface for running the Actor on the Apify platform.

{
  "title": "Instagram Analysis Agent Input",
  "type": "object",
  "schemaVersion": 1,
  "properties": {
    "query": {
      "title": "Query",
      "type": "string",
      "description": "Task for the agent to perform",
      "example": "Analyze @openai posts for AI trends"
    },
    "modelName": {
      "title": "Model Name",
      "type": "string",
      "description": "OpenAI model to use",
      "default": "gpt-4"
    }
  },
  "required": ["query"]
}

Define the dataset schema in .actor/dataset_schema.json. This helps structure the data pushed to the dataset.

{
  "title": "Instagram Analysis Output",
  "type": "object",
  "properties": {
    "query": {
      "title": "Query",
      "type": "string"
    },
    "response": {
      "title": "Response",
      "type": "string"
    }
  }
}

Step 6: Configure tools

The Instagram post scraper tool is implemented using the Instagram Scraper Actor. The tool returns structured output as Pydantic models defined in src/models.py:

class InstagramPost(BaseModel):
    id: str
    url: str
    caption: str
    timestamp: datetime
    likes_count: int
    comments_count: int

The tool is defined in src/tools.py and includes:

Tool description and argument schema for the agent
Integration with Instagram Scraper Actor
Data retrieval and formatting

Step 7: Implement the agent

The agent implementation in src/main.py includes:

Handle Actor input: Read the user's query and any other parameters from the Actor input.

async def main():
    async with Actor:
        actor_input = await Actor.get_input()
        query = actor_input.get("query")
        model_name = actor_input.get("modelName", "gpt-4")

Define the agent: Instantiate the agent, giving it a role, a goal, and access to the tools you configured.

agent = Agent(
    role="Social Media Analyst",
    goal="Analyze Instagram posts and provide insights",
    backstory="Expert in social media analysis and trend identification",
    tools=[instagram_scraper_tool],
    llm=ChatOpenAI(model=model_name)
)

Create task and crew: Define the task for the agent to complete based on the user's query.

task = Task(
    description=query,
    agent=agent,
    expected_output="Detailed analysis with insights"
)

crew = Crew(
    agents=[agent],
    tasks=[task]
)

Execute and save results: Kick off the crew to run the task and save the final result to the Actor's default dataset.
```
result = crew.kickoff()
await Actor.push_data({
    "query": query,
    "response": str(result)
})
```

Step 8: Test locally

Run the agent on your local machine using the Apify CLI. Ensure you have set any required environment variables (e.g., OPENAI_API_KEY).

apify run

Step 9: Deploy to Apify

Push your Actor's code to the Apify platform.

apify push

After deployment:

Navigate to your Actor's settings.
Set OPENAI_API_KEY as a secret environment variable.
Rebuild the Actor version to apply the changes.

Step 10: Test the deployed agent

Run the agent on the platform with a sample query and monitor the results in the output dataset.

Analyze the posts of the @openai and @googledeepmind and summarize me current trends in the AI.

Troubleshooting

Common issues and solutions:

Agent fails to call tools: Check that the tool descriptions in src/tools.py are clear and the argument schemas are correct.
Instagram scraper fails: Verify that the Instagram usernames exist and are public. Check the scraper Actor's run logs for specific errors.
Missing API key: Ensure OPENAI_API_KEY is set as a secret environment variable in your Actor's Settings.

Monetizing your AI agent

Apify's pay-per-event (PPE) pricing model allows charging users based on specific triggered events through the API or SDKs.

How pay-per-event pricing works

If you want more details about PPE pricing, refer to our PPE documentation.

Step 1: Define chargeable events

You can configure charges for events like the Actor starting, a task completing successfully, or custom events such as specific API calls.

Example event definition:

{
  "eventName": "task-completed",
  "description": "Charge for completed analysis task",
  "price": 0.10
}

Step 2: Implement charging in code

Add charging logic to your code:

await Actor.charge({
    "eventName": "task-completed",
    "amount": 1
})

Step 3: Configure PPE settings

Enable pay-per-event monetization in Actor settings.
Define events from pay_per_event.json.
Set pricing for each event.

Step 4: Publish the agent

Before making your agent public on Apify Store, complete the following checklist:

Update README with usage instructions.
Validate input_schema.json and dataset_schema.json.
Verify OPENAI_API_KEY environment variable is handled correctly.
Check monetization settings on the Actor publication page.
Test the Actor thoroughly.
Set your Actor's visibility to public.

Next steps

To continue developing AI agents:

Use the CrewAI template: Start with apify create agent-actor -t python-crewai
Explore other templates: Visit the Apify templates page for alternatives
Review existing agents: Check the AI agents collection on Apify Store
Publish and monetize: Deploy with apify push and enable monetization