Chapter 3: Task - Defining the Work

In Chapter 1, we met the Crew - our AI team manager. In Chapter 2, we met the Agents - our specialized AI workers. Now, we need to tell these agents exactly what to do. How do we give them specific assignments?

That’s where the Task comes in!

Why Do We Need Tasks?

Imagine our trip planning Crew again. We have a ‘Travel Researcher’ Agent and an ‘Activity Planner’ Agent. Just having them isn’t enough. We need to give them clear instructions:

Researcher: “Find some sunny cities in Europe for May.”
Planner: “Create a 3-day plan for the city the Researcher found.”

These specific instructions are Tasks in CrewAI. Instead of one vague goal, we break the project down into smaller, concrete steps.

Problem Solved: Task allows you to define individual, actionable assignments for your Agents. It turns a big goal into a manageable checklist.

What is a Task?

Think of a Task as a work order or a specific assignment given to an Agent. It clearly defines what needs to be done and what the expected result should look like.

Here are the key ingredients of a Task:

description: This is the most important part! It’s a clear and detailed explanation of what the Agent needs to accomplish. The more specific, the better.
expected_output: This tells the Agent what a successful result should look like. It sets a clear target. Examples: “A list of 3 cities with pros and cons.”, “A bulleted list of activities.”, “A paragraph summarizing the key findings.”
agent: This specifies which Agent in your Crew is responsible for completing this task. Each task is typically assigned to the agent best suited for it.
context (Optional but Important!): Tasks don’t usually happen in isolation. A task might need information or results from previous tasks. The context allows the output of one task to be automatically fed as input/background information to the next task in a sequence.
tools (Optional): You can specify a list of Tools that the Agent is allowed to use specifically for this task. This can be useful to restrict or grant specific capabilities for certain assignments.
async_execution (Optional, Advanced): You can set this to True if you want the task to potentially run at the same time as other asynchronous tasks. We’ll stick to synchronous (one after another) for now.
output_json / output_pydantic (Optional, Advanced): If you need the task’s final output in a structured format like JSON, you can specify a model here.
output_file (Optional, Advanced): You can have the task automatically save its output to a file.

A Task bundles the instructions (description, expected_output) and assigns them to the right worker (agent), potentially giving them background info (context) and specific equipment (tools).

Let’s Define a Task!

Let’s look again at the tasks we created for our trip planning Crew in Chapter 1.

# Import necessary classes
from crewai import Task, Agent # Assuming Agent class is defined as in Chapter 2

# Assume 'researcher' and 'planner' agents are already defined
# researcher = Agent(role='Travel Researcher', ...)
# planner = Agent(role='Activity Planner', ...)

# Define Task 1 for the Researcher
task1 = Task(
  description=(
      "Identify the top 3 European cities known for great sunny weather "
      "around late May. Focus on cities with vibrant culture and good food."
  ),
  expected_output=(
      "A numbered list of 3 cities, each with a brief (1-2 sentence) justification "
      "mentioning weather, culture, and food highlights."
  ),
  agent=researcher # Assign this task to our researcher agent
)

# Define Task 2 for the Planner
task2 = Task(
  description=(
      "Using the list of cities provided by the researcher, select the best city "
      "and create a detailed 3-day itinerary. Include morning, afternoon, and "
      "evening activities, plus restaurant suggestions."
  ),
  expected_output=(
      "A markdown formatted 3-day itinerary for the chosen city. "
      "Include timings, activity descriptions, and 2-3 restaurant ideas."
  ),
  agent=planner # Assign this task to our planner agent
  # context=[task1] # Optionally explicitly define context (often handled automatically)
)

# (You would then add these tasks to a Crew)
# print(task1)
# print(task2)

Explanation:

from crewai import Task: We import the Task class.
description=...: We write a clear instruction for the agent. Notice how task1 specifies the criteria (sunny, May, culture, food). task2 explicitly mentions using the output from the previous task.
expected_output=...: We define what success looks like. task1 asks for a numbered list with justifications. task2 asks for a formatted itinerary. This helps the AI agent structure its response.
agent=researcher / agent=planner: We link each task directly to the Agent responsible for doing the work.
context=[task1] (Commented Out): We could explicitly tell task2 that it depends on task1. However, when using a sequential Process in the Crew, this dependency is usually handled automatically! The output of task1 will be passed to task2 as context.

Running this code creates Task objects, ready to be managed by a Crew.

Task Workflow and Context: Connecting the Dots

Tasks are rarely standalone. They often form a sequence, where the result of one task is needed for the next. This is where context comes in.

Imagine our Crew is set up with a sequential Process (like in Chapter 1):

The Crew runs task1 using the researcher agent.
The researcher completes task1 and produces an output (e.g., “1. Lisbon…”, “2. Seville…”, “3. Malta…”). This output is stored.
The Crew moves to task2. Because it’s sequential, it automatically takes the output from task1 and provides it as context to task2.
The planner agent receives task2’s description and the list of cities from task1 as context.
The planner uses this context to complete task2 (e.g., creates an itinerary for Lisbon).

This automatic passing of information makes building workflows much easier!

graph LR
    A["Task 1: Find Cities (Agent: Researcher)"] -->|Output: Lisbon, Seville, Malta| B[Context for Task 2]
    B --> C["Task 2: Create Itinerary (Agent: Planner)"]
    C -->|Output: Lisbon Itinerary...| D[Final Result]

    style A fill:#f9f,stroke:#333,stroke-width:2px
    style C fill:#f9f,stroke:#333,stroke-width:2px
    style B fill:#ccf,stroke:#333,stroke-width:1px,stroke-dasharray: 5 5
    style D fill:#cfc,stroke:#333,stroke-width:2px

While the sequential process often handles context automatically, you can explicitly define dependencies using the context parameter in the Task definition if you need more control, especially with more complex workflows.

How Does a Task Execute “Under the Hood”?

When the Crew’s kickoff() method runs a task, here’s a simplified view of what happens:

Selection: The Crew (based on its Process) picks the next Task to execute.
Agent Assignment: It identifies the agent assigned to this Task.
Context Gathering: It collects the output from any prerequisite tasks (like the previous task in a sequential process) to form the context.
Execution Call: The Crew tells the assigned Agent to execute the Task, passing the description, expected_output, available tools (if any specified for the task), and the gathered context.
Agent Work: The Agent uses its configuration (LLM, backstory, etc.) and the provided information (task details, context, tools) to perform the work.
Result Return: The Agent generates the result and returns it as a TaskOutput object.
Output Storage: The Crew receives this TaskOutput and stores it, making it available as potential context for future tasks.

Let’s visualize the interaction:

sequenceDiagram
    participant C as Crew
    participant T1 as Task 1
    participant R_Agent as Researcher Agent
    participant T2 as Task 2
    participant P_Agent as Planner Agent

    C->>T1: Prepare to Execute
    Note right of T1: Task 1 selected
    C->>R_Agent: Execute Task(T1.description, T1.expected_output)
    R_Agent->>R_Agent: Use LLM, Profile, Tools...
    R_Agent-->>C: Return TaskOutput (Cities List)
    C->>C: Store TaskOutput from T1

    C->>T2: Prepare to Execute
    Note right of T2: Task 2 selected
    Note right of C: Get Context (Output from T1)
    C->>P_Agent: Execute Task(T2.description, T2.expected_output, context=T1_Output)
    P_Agent->>P_Agent: Use LLM, Profile, Tools, Context...
    P_Agent-->>C: Return TaskOutput (Itinerary)
    C->>C: Store TaskOutput from T2

Diving into the Code (task.py)

The Task class itself is defined in crewai/task.py. It’s primarily a container for the information you provide:

# Simplified view from crewai/task.py
from pydantic import BaseModel, Field
from typing import List, Optional, Type, Any
# Import Agent and Tool placeholders for the example
from crewai import BaseAgent, BaseTool

class TaskOutput(BaseModel): # Simplified representation of the result
    description: str
    raw: str
    agent: str
    # ... other fields like pydantic, json_dict

class Task(BaseModel):
    # Core attributes
    description: str = Field(description="Description of the actual task.")
    expected_output: str = Field(description="Clear definition of expected output.")
    agent: Optional[BaseAgent] = Field(default=None, description="Agent responsible.")

    # Optional attributes
    context: Optional[List["Task"]] = Field(default=None, description="Context from other tasks.")
    tools: Optional[List[BaseTool]] = Field(default_factory=list, description="Task-specific tools.")
    async_execution: Optional[bool] = Field(default=False)
    output_json: Optional[Type[BaseModel]] = Field(default=None)
    output_pydantic: Optional[Type[BaseModel]] = Field(default=None)
    output_file: Optional[str] = Field(default=None)
    callback: Optional[Any] = Field(default=None) # Function to call after execution

    # Internal state
    output: Optional[TaskOutput] = Field(default=None, description="Task output after execution")

    def execute_sync(
        self,
        agent: Optional[BaseAgent] = None,
        context: Optional[str] = None,
        tools: Optional[List[BaseTool]] = None,
    ) -> TaskOutput:
        # 1. Identify the agent to use (passed or self.agent)
        agent_to_execute = agent or self.agent
        if not agent_to_execute:
            raise Exception("No agent assigned to task.")

        # 2. Prepare tools (task tools override agent tools if provided)
        execution_tools = tools or self.tools or agent_to_execute.tools

        # 3. Call the agent's execute_task method
        #    (The agent handles LLM calls, tool use, etc.)
        raw_result = agent_to_execute.execute_task(
            task=self, # Pass self (the task object)
            context=context,
            tools=execution_tools,
        )

        # 4. Format the output
        # (Handles JSON/Pydantic conversion if requested)
        pydantic_output, json_output = self._export_output(raw_result)

        # 5. Create and return TaskOutput object
        task_output = TaskOutput(
            description=self.description,
            raw=raw_result,
            pydantic=pydantic_output,
            json_dict=json_output,
            agent=agent_to_execute.role,
            # ... other fields
        )
        self.output = task_output # Store the output within the task object

        # 6. Execute callback if defined
        if self.callback:
            self.callback(task_output)

        # 7. Save to file if output_file is set
        if self.output_file:
            # ... logic to save file ...
            pass

        return task_output

    def prompt(self) -> str:
        # Combines description and expected output for the agent
        return f"{self.description}\n\nExpected Output:\n{self.expected_output}"

    # ... other methods like execute_async, _export_output, _save_file ...

Key takeaways from the code:

The Task class holds the configuration (description, expected_output, agent, etc.).
The execute_sync (and execute_async) method orchestrates the execution by calling the assigned agent’s execute_task method. The task itself doesn’t contain the AI logic; it delegates that to the agent.
It takes the raw result from the agent and wraps it in a TaskOutput object, handling formatting (like JSON) and optional actions (callbacks, file saving).
The prompt() method shows how the core instructions are formatted before being potentially combined with context and tool descriptions by the agent.

Advanced Task Features (A Quick Peek)

While we focused on the basics, Task has more capabilities:

Asynchronous Execution (async_execution=True): Allows multiple tasks to run concurrently, potentially speeding up your Crew if tasks don’t strictly depend on each other’s immediate output.
Structured Outputs (output_json, output_pydantic): Force the agent to return data in a specific Pydantic model or JSON structure, making it easier to use the output programmatically.
File Output (output_file='path/to/output.txt'): Automatically save the task’s result to a specified file.
Conditional Tasks (ConditionalTask): A special type of task (defined in crewai.tasks.conditional_task) that only runs if a specific condition (based on the previous task’s output) is met. This allows for branching logic in your workflows.

Conclusion

You’ve now learned about the Task – the fundamental unit of work in CrewAI. A Task defines what needs to be done (description), what the result should look like (expected_output), and who should do it (agent). Tasks are the building blocks of your Crew’s plan, and their outputs often flow as context to subsequent tasks, creating powerful workflows.

We’ve seen how to define Agents and give them Tasks. But what if an agent needs a specific ability, like searching the internet, calculating something, or reading a specific document? How do we give our agents superpowers? That’s where Tools come in! Let’s explore them in the next chapter.

Next: Chapter 4: Tool - Equipping Your Agents

Generated by AI Codebase Knowledge Builder