BuildFast Bot
Ask to

BuildFast Bot

BuildFast Bot

Hey! Wanna know about Generative AI Crash Course?

BuildFastwithAI
satvik@buildfastwithai.com

Koramangala, Bengaluru, 560034

Support

  • Consulting
  • GenAI Course
  • BuildFast Studio

Company

  • Resources
  • Events

Legal

  • Privacy
  • Terms
  • Refund

Our Products

Educhain

Educhain

AI-powered education platform for teachers

BuildFast Studio

BuildFast Studio

The Indian version of CharacterAI but even more varieties.

LinkedInInstagramTwitterGitHub

© 2025 Intellify Edventures Private Limited All rights reserved.

Data Analysis with PandasAI: An Intelligent Way to Explore Data

December 11, 2024
3 min read
Published
Data Analysis with PandasAI: An Intelligent Way to Explore Data
Data Analysis with PandasAI: An Intelligent Way to Explore Data - BuildFast with AI

Introduction to Data Analysis with PandasAI

PandasAI is an innovative Python library that enhances traditional data analysis with the power of natural language processing (NLP). It allows you to query pandas DataFrames using natural language queries, making data exploration more intuitive and accessible.

In this blog, you'll learn how to:

  1. Set up PandasAI in your environment.
  2. Create a Smart DataFrame.
  3. Perform basic and advanced data analysis using natural language.
  4. Visualize data insights with intelligent queries.

Let's dive in!

1. Installation and Setup

First, let's install the PandasAI library. You can do this using pip:

!pip install -qU pandasai pandas

We also need to install the pandas library if it's not already installed:

pip install pandas

2. Creating a Smart DataFrame

To use PandasAI, you'll need a Smart DataFrame (a pandas DataFrame enhanced with AI capabilities). Here's a step-by-step guide to creating one.

Import Libraries

import pandas as pd
from pandasai import PandasAI
from pandasai.llm.openai import OpenAI

Initialize the PandasAI Engine

You'll need an OpenAI API key to use the language model. Initialize PandasAI with your key as follows:

llm = OpenAI(api_token="YOUR_OPENAI_API_KEY")
pandas_ai = PandasAI(llm)

Sample Data

Let's create a simple DataFrame with sales data:

data = {
    "Product": ["Laptop", "Mouse", "Keyboard", "Monitor", "Headphones"],
    "Sales": [120, 340, 150, 80, 300],
    "Revenue": [120000, 3400, 15000, 8000, 30000]
}
df = pd.DataFrame(data)
print(df)

Output:

     Product  Sales  Revenue
0     Laptop    120   120000
1      Mouse    340     3400
2   Keyboard    150    15000
3    Monitor     80     8000
4  Headphones   300    30000

3. Basic Data Analysis with Natural Language

Now that we have a Smart DataFrame, let's perform some basic analysis using natural language queries.

Example Query 1: "What is the total revenue?"

pandas_ai.run(df, prompt="What is the total revenue?")

Output:

The total revenue is $180,400.

Example Query 2: "Which product has the highest sales?"

pandas_ai.run(df, prompt="Which product has the highest sales?")

Output:

The product with the highest sales is Mouse with 340 units sold.

4. Advanced Queries and Visualizations

PandasAI can also generate visualizations based on your queries.

Example Query: "Show a bar chart of sales by product."

pandas_ai.run(df, prompt="Show a bar chart of sales by product.")

Output:

A bar chart will be generated showing sales figures for each product.

Example Query: "What is the average revenue?"

pandas_ai.run(df, prompt="What is the average revenue?")

Output:

The average revenue is $36,080.

Conclusion

PandasAI simplifies data analysis by allowing you to interact with your datasets using natural language. This is especially useful for those who may not be familiar with Python or pandas syntax but still need to extract insights from data.

Key Takeaways

  • Ease of Use: Natural language queries make data analysis accessible.
  • Integration: Works seamlessly with pandas DataFrames.
  • Visualization: Automatically generates charts based on queries.

Next Steps

  • Experiment with your own datasets.
  • Combine PandasAI with other libraries like matplotlib and seaborn for enhanced visualizations.
  • Explore more complex queries and custom prompts.

Resources

  • PandasAI GitHub Repository
  • OpenAI API Documentation
  • Build Fast with AI Pandas GitHub Repo

--------------------------------------------------------------------------

Stay Updated:- Follow Build Fast with AI pages for all the latest AI updates and resources.

Experts predict 2025 will be the defining year for Gen AI implementation.Want to be ahead of the curve?

Join Build Fast with AI’s Gen AI Launch Pad 2025 - your accelerated path to mastering AI tools and building revolutionary applications.

👉 Limited Spots, join the waitlist now: www.buildfastwithai.com/genai-course 

buildfastwithai
GenAI Bootcamp
Daily GenAI Quiz
BuildFast Studio
Resources
buildfastwithai