On this page

How to Automate Customer Sentiment Analysis Reports: A Guide for Developers

May 20, 2024 86 min

How to Automate Customer Sentiment Analysis Reports: A Guide for Developers

If you’re a developer, especially for an automated platform company, this guide is for you. It walks you through a script we created to generate customer sentiment analysis reports automatically. Creating these reports manually takes a lot of time and energy — something few developers have. However, you can automate the report generation process with AI and some scripting. You can find links to download the script file and related materials at the bottom of this guide.

What you’ll need to run the script

Customer Reviews Data — You should obtain customer reviews data from a reliable source. For this guide, we’re getting the data from the Webz.io eCommerce Reviews API. It provides product information and customer reviews from 900+ eCommerce and marketplace sources.
- You need an API key to use the Webz.io eCommerce Reviews API, and you can get one by contacting Webz.io. This guide includes a free NDJSON file with sample reviews data if you would like to experiment with the script without using the API.
OpenAI API — You’ll use OpenAI’s API to leverage the GPT-4 and DALL·E models. GPT-4 analyzes and summarizes the text from customer reviews, while DALL·E generates a main image for the report.
- You also need an API key for the OpenAI API. Create an account or sign in at OpenAI to get a key. OpenAI uses pay-per-use pricing for its language and image models. You can see the price points on the OpenAI website.
Python — We’re using Python to automate the report creation process. You’ll need to ensure you can run Python code on your machine.

alt="How Webz.io and ChatGPT automate your report"

Automating customer sentiment analysis reports: script breakdown

The script fetches product reviews from an external NDJSON file generated by the Webz.io eCommerce Reviews API. Next, the script calls the OpenAI API, using its text and image models to analyze the reviews for positive and negative sentiments. It then compiles these findings into a structured report, outputting them into a Word document. The document contains the completed customer sentiment analysis report.

Get Started with News API – For Free!

Here is the detailed breakdown of the script:

Import files, packages, and modules

First, the script imports the NDJSON file, Python packages and modules, the OpenAI Python API library, and other necessary files.

import json<br /><br /><br /><br /><br /><br />
import glob<br /><br /><br /><br /><br /><br />
import docx<br /><br /><br /><br /><br /><br />
import requests<br /><br /><br /><br /><br /><br />
from openai import OpenAI<br /><br /><br /><br /><br /><br />
import os<br /><br /><br /><br /><br /><br />
import openai<br /><br /><br /><br /><br /><br />
from docx.shared import Pt<br /><br /><br /><br /><br /><br />
from bs4 import BeautifulSoup<br /><br /><br /><br /><br /><br />
import io<br /><br /><br /><br /><br /><br />
from docx.shared import Pt<br /><br /><br /><br /><br /><br />
from docx.enum.text import WD_ALIGN_PARAGRAPH<br /><br /><br /><br /><br /><br />
from docx.oxml.shared import OxmlElement, qn<br /><br /><br /><br /><br /><br />
from docx.opc.constants import RELATIONSHIP_TYPE

import json

import glob

import docx

import requests

from openai import OpenAI

import os

import openai

from docx.shared import Pt

from bs4 import BeautifulSoup

import io

from docx.shared import Pt

from docx.enum.text import WD_ALIGN_PARAGRAPH

from docx.oxml.shared import OxmlElement, qn

from docx.opc.constants import RELATIONSHIP_TYPE

Set global variable and access API key

Next, the script accesses the Open AI API key through the development environment. It also includes a global variable where you can set the number of reviews included in the report.

openai.api_key = os.getenv(“OPENAI_API_KEY”)<br /><br /><br /><br /><br /><br />
NUM_OF_REVIEWS = 50<br /><br /><br /><br /><br /><br />
client = OpenAI()

openai.api_key = os.getenv(“OPENAI_API_KEY”)

NUM_OF_REVIEWS = 50

client = OpenAI()

Orchestrate entire process (main)

Towards the end of the script, you’ll see the “main” function. It orchestrates the entire process — from reading the reviews and generating a report main image to generating the report text and creating the final Word document.

def main():<br /><br /><br /><br /><br /><br />
    # Path to the reviews folder<br /><br /><br /><br /><br /><br />
    reviews_folder = ‘reviews'</p><br /><br /><br /><br /><br />
<p>    # Read product information<br /><br /><br /><br /><br /><br />
    product_info = read_ndjson_file(os.path.join(reviews_folder, ‘Products.ndjson’))[0]</p><br /><br /><br /><br /><br />
<p>    # Initialize an empty list to store all reviews<br /><br /><br /><br /><br /><br />
    positive_reviews = []<br /><br /><br /><br /><br /><br />
    negative_reviews = []</p><br /><br /><br /><br /><br />
<p>    # Counters for each star rating<br /><br /><br /><br /><br /><br />
    one_star_count = 0<br /><br /><br /><br /><br /><br />
    two_star_count = 0<br /><br /><br /><br /><br /><br />
    three_star_count = 0<br /><br /><br /><br /><br /><br />
    four_star_count = 0<br /><br /><br /><br /><br /><br />
    five_star_count = 0</p><br /><br /><br /><br /><br />
<p>    # Read all reviews files in the reviews folder and add those with substantial text into separate lists<br /><br /><br /><br /><br /><br />
    for review_file in glob.glob(os.path.join(reviews_folder, ‘Reviews_*.ndjson’)):<br /><br /><br /><br /><br /><br />
        reviews = read_ndjson_file(review_file)<br /><br /><br /><br /><br /><br />
        for review in reviews:</p><br /><br /><br /><br /><br />
<p>            if review[‘rating’] == 1:<br /><br /><br /><br /><br /><br />
                one_star_count += 1<br /><br /><br /><br /><br /><br />
            elif review[‘rating’] == 2:<br /><br /><br /><br /><br /><br />
                two_star_count += 1<br /><br /><br /><br /><br /><br />
            elif review[‘rating’] == 3:<br /><br /><br /><br /><br /><br />
                three_star_count += 1<br /><br /><br /><br /><br /><br />
            elif review[‘rating’] == 4:<br /><br /><br /><br /><br /><br />
                four_star_count += 1<br /><br /><br /><br /><br /><br />
            elif review[‘rating’] == 5:<br /><br /><br /><br /><br /><br />
                five_star_count += 1</p><br /><br /><br /><br /><br />
<p>            if len(review[‘text’]) > 100:<br /><br /><br /><br /><br /><br />
                if review[‘rating’] <3: #  1-2 stars rating is negative<br /><br /><br /><br /><br /><br />
                    negative_reviews.append(review)<br /><br /><br /><br /><br /><br />
                if review[‘rating’] >3: # 4-5 starts reating is positive<br /><br /><br /><br /><br /><br />
                    positive_reviews.append(review)</p><br /><br /><br /><br /><br />
<p>    image_url = generate_article_image(product_info[‘name’])<br /><br /><br /><br /><br /><br />
    title = generate_title(product_info[‘name’])<br /><br /><br /><br /><br /><br />
    intro = generate_intro(product_info[‘name’], product_info[‘description’])</p><br /><br /><br /><br /><br />
<p>    positive_bullet_points = “\n”.join(extract_points(positive_reviews, ‘positive’))<br /><br /><br /><br /><br /><br />
    negative_bullet_points = “\n”.join(extract_points(negative_reviews, ‘negative’))</p><br /><br /><br /><br /><br />
<p>    positive_report = create_positive_report(positive_bullet_points, product_info[‘name’])<br /><br /><br /><br /><br /><br />
    negative_report = create_negative_report(negative_bullet_points, product_info[‘name’])</p><br /><br /><br /><br /><br />
<p>    create_word_doc(“customer sentiment analysis report.docx”, title, image_url, product_info , intro, negative_report, positive_report,<br /><br /><br /><br /><br /><br />
                    one_star_count, two_star_count, three_star_count, four_star_count, five_star_count)</p><br /><br /><br /><br /><br />
<p>    print(“done”)</p><br /><br /><br /><br /><br />
<p>if __name__ == “__main__”:<br /><br /><br /><br /><br /><br />
    main()

def main():

# Path to the reviews folder

reviews_folder = ‘reviews’

# Read product information

product_info = read_ndjson_file(os.path.join(reviews_folder, ‘Products.ndjson’))[0]

# Initialize an empty list to store all reviews

positive_reviews = []

negative_reviews = []

# Counters for each star rating

one_star_count = 0

two_star_count = 0

three_star_count = 0

four_star_count = 0

five_star_count = 0

# Read all reviews files in the reviews folder and add those with substantial text into separate lists

for review_file in glob.glob(os.path.join(reviews_folder, ‘Reviews_*.ndjson’)):

reviews = read_ndjson_file(review_file)

for review in reviews:

if review[‘rating’] == 1:

one_star_count += 1

elif review[‘rating’] == 2:

two_star_count += 1

elif review[‘rating’] == 3:

three_star_count += 1

elif review[‘rating’] == 4:

four_star_count += 1

elif review[‘rating’] == 5:

five_star_count += 1

if len(review[‘text’]) > 100:

if review[‘rating’] <3: # 1-2 stars rating is negative

negative_reviews.append(review)

if review[‘rating’] >3: # 4-5 starts reating is positive

positive_reviews.append(review)

image_url = generate_article_image(product_info[‘name’])

title = generate_title(product_info[‘name’])

intro = generate_intro(product_info[‘name’], product_info[‘description’])

positive_bullet_points = “\n”.join(extract_points(positive_reviews, ‘positive’))

negative_bullet_points = “\n”.join(extract_points(negative_reviews, ‘negative’))

positive_report = create_positive_report(positive_bullet_points, product_info[‘name’])

negative_report = create_negative_report(negative_bullet_points, product_info[‘name’])

create_word_doc(“customer sentiment analysis report.docx”, title, image_url, product_info , intro, negative_report, positive_report,

one_star_count, two_star_count, three_star_count, four_star_count, five_star_count)

print(“done”)

if __name__ == “__main__”:

main()

Define functions

Now we define the different functions of our script:

Read NDJSON file (read_ndjson_file)

Reads a NDJSON file and returns its content.

def read_ndjson_file(file_path):<br /><br /><br /><br /><br /><br />
    “””Reads an ndjson file and returns the content as a list of dictionaries.”””<br /><br /><br /><br /><br /><br />
    with open(file_path, ‘r’, encoding=’utf-8′) as file:<br /><br /><br /><br /><br /><br />
        return [json.loads(line) for line in file]

def read_ndjson_file(file_path):

“””Reads an ndjson file and returns the content as a list of dictionaries.”””

with open(file_path, ‘r’, encoding=‘utf-8’) as file:

return [json.loads(line) for line in file]

Send prompt (call_gpt_completion)

Sends a prompt to the GPT-4 model and receives a response.

def call_gpt_completion(prompt):<br /><br /><br /><br /><br /><br />
    return client.chat.completions.create(<br /><br /><br /><br /><br /><br />
        model=”gpt-4-1106-preview”,<br /><br /><br /><br /><br /><br />
        max_tokens=4096,<br /><br /><br /><br /><br /><br />
        messages=[<br /><br /><br /><br /><br /><br />
            {“role”: “user”, “content”: prompt},<br /><br /><br /><br /><br /><br />
        ]<br /><br /><br /><br /><br /><br />
    )

def call_gpt_completion(prompt):

return client.chat.completions.create(

model=“gpt-4-1106-preview”,

max_tokens=4096,

messages=[

{“role”: “user”, “content”: prompt},

]

)

Extract points (extract_points)

Extracts key points from reviews based on sentiment (positive/negative).

def extract_points(reviews, sentiment):<br /><br /><br /><br /><br /><br />
    print(“Extract Points: ” + sentiment)<br /><br /><br /><br /><br /><br />
    points = []<br /><br /><br /><br /><br /><br />
    for review in reviews:<br /><br /><br /><br /><br /><br />
        review_text = review[‘title’] + “\n” + review[‘text’]<br /><br /><br /><br /><br /><br />
        prompt = f”The following is a {sentiment} review of a product, summarize in one bullet point the main {sentiment} feedback:\n{review_text}”<br /><br /><br /><br /><br /><br />
        summary = “”<br /><br /><br /><br /><br /><br />
        try:<br /><br /><br /><br /><br /><br />
            response = call_gpt_completion(prompt)<br /><br /><br /><br /><br /><br />
            for choice in response.choices:<br /><br /><br /><br /><br /><br />
                summary += choice.message.content<br /><br /><br /><br /><br /><br />
        except Exception as e:<br /><br /><br /><br /><br /><br />
            print(“An error occurred:”, str(e))<br /><br /><br /><br /><br /><br />
        points.append(summary)<br /><br /><br /><br /><br /><br />
        if len(points) == NUM_OF_REVIEWS:<br /><br /><br /><br /><br /><br />
            break<br /><br /><br /><br /><br /><br />
    return points

def extract_points(reviews, sentiment):

print(“Extract Points: “ + sentiment)

points = []

for review in reviews:

review_text = review[‘title’] + “\n” + review[‘text’]

prompt = f“The following is a {sentiment} review of a product, summarize in one bullet point the main {sentiment} feedback:\n{review_text}”

summary = “”

try:

response = call_gpt_completion(prompt)

for choice in response.choices:

summary += choice.message.content

except Exception as e:

print(“An error occurred:”, str(e))

points.append(summary)

if len(points) == NUM_OF_REVIEWS:

break

return points

Generate title (generate_title)

Creates a title for the sentiment analysis report.

def generate_title(product_name):<br /><br /><br /><br /><br /><br />
    print(“Creating a title”)</p><br /><br /><br /><br /><br />
<p>    prompt = “Create a title for a customer sentiment report about the following product:\n” + product_name<br /><br /><br /><br /><br /><br />
    title_text = “”<br /><br /><br /><br /><br /><br />
    try:<br /><br /><br /><br /><br /><br />
        response = call_gpt_completion(prompt)</p><br /><br /><br /><br /><br />
<p>        for choice in response.choices:<br /><br /><br /><br /><br /><br />
            title_text += choice.message.content<br /><br /><br /><br /><br /><br />
    except Exception as e:<br /><br /><br /><br /><br /><br />
        print(“An error occurred:”, str(e))</p><br /><br /><br /><br /><br />
<p>    title_text = title_text.strip(” “).strip(‘\”‘)<br /><br /><br /><br /><br /><br />
    if title_text.startswith(“Title:”):  # Sometimes ChatGPT prefix the title with Title:<br /><br /><br /><br /><br /><br />
        return title_text[len(“Title:”):]</p><br /><br /><br /><br /><br />
<p>    return title_text

def generate_title(product_name):

print(“Creating a title”)

prompt = “Create a title for a customer sentiment report about the following product:\n” + product_name

title_text = “”

try:

response = call_gpt_completion(prompt)

for choice in response.choices:

title_text += choice.message.content

except Exception as e:

print(“An error occurred:”, str(e))

title_text = title_text.strip(” “).strip(‘\”‘)

if title_text.startswith(“Title:”): # Sometimes ChatGPT prefix the title with Title:

return title_text[len(“Title:”):]

return title_text

Generate introduction (generate_intro)

Generates an introductory paragraph for the report.

def generate_intro(product_name, product_description):<br /><br /><br /><br /><br /><br />
    print(“Generate post intro”)</p><br /><br /><br /><br /><br />
<p>    prompt = f”””<br /><br /><br /><br /><br /><br />
        Write a paragraph introducing a customer sentiment report about:<br /><br /><br /><br /><br /><br />
        Product name: {product_name}<br /><br /><br /><br /><br /><br />
        Product description: {product_description}</p><br /><br /><br /><br /><br />
<p>        The report is created automatically by using Webz.io eCommerce api and ChatGPT. The report is generated by calling the Webz.io eCommerce API for the reviews about {product_name}. It then splits the product reviews into positive and negative reviews. Following this step, it summarizes up to {NUM_OF_REVIEWS} reviews from both negative and positive reviews using ChatGPT to create a comprehensive list of posts. It then gives those lists to ChatGPT to create a comprehensive report highlighting both positive and negative feedback and provide a report based on the feedback.<br /><br /><br /><br /><br /><br />
        “””</p><br /><br /><br /><br /><br />
<p>    intro = “”<br /><br /><br /><br /><br /><br />
    try:<br /><br /><br /><br /><br /><br />
        response = call_gpt_completion(prompt)</p><br /><br /><br /><br /><br />
<p>        for choice in response.choices:<br /><br /><br /><br /><br /><br />
            intro += choice.message.content<br /><br /><br /><br /><br /><br />
    except Exception as e:<br /><br /><br /><br /><br /><br />
        print(“An error occurred:”, str(e))</p><br /><br /><br /><br /><br />
<p>    return intro

def generate_intro(product_name, product_description):

print(“Generate post intro”)

prompt = f“””

Write a paragraph introducing a customer sentiment report about:

Product name: {product_name}

Product description: {product_description}

The report is created automatically by using Webz.io eCommerce api and ChatGPT. The report is generated by calling the Webz.io eCommerce API for the reviews about {product_name}. It then splits the product reviews into positive and negative reviews. Following this step, it summarizes up to {NUM_OF_REVIEWS} reviews from both negative and positive reviews using ChatGPT to create a comprehensive list of posts. It then gives those lists to ChatGPT to create a comprehensive report highlighting both positive and negative feedback and provide a report based on the feedback.

“””

intro = “”

try:

response = call_gpt_completion(prompt)

for choice in response.choices:

intro += choice.message.content

except Exception as e:

print(“An error occurred:”, str(e))

return intro

Generate negative report (create_negative_report)

Creates a report based on negative feedback.

def create_negative_report(feedback, product_name):<br /><br /><br /><br /><br /><br />
    print(“Generating Negative Report”)</p><br /><br /><br /><br /><br />
<p>    prompt = f”””Create a customer sentiment analysis report that includes the following sections. Use  <UL> and <LI> tags for listing items and <strong> for the titles of each section.</p><br /><br /><br /><br /><br />
<p>            <HTML><br /><br /><br /><br /><br /><br />
            <strong>Analysis of Feedback</strong><br /><br /><br /><br /><br /><br />
            <UL><LI>Summarize recurring and common negative issues mentioned in the reviews.</LI></UL><br /><br /><br /><br /><br /><br />
            <strong>Recommendations</strong><br /><br /><br /><br /><br /><br />
            <UL><LI>Based on the analysis, suggest actionable measures the company can take to address the issues raised in the feedback.</LI></UL><br /><br /><br /><br /><br /><br />
            <strong>Conclusion</strong><br /><br /><br /><br /><br /><br />
            <UL><LI>Summarize the key findings of the report.</LI></UL><br /><br /><br /><br /><br /><br />
            </HTML></p><br /><br /><br /><br /><br />
<p>                The following is the list of the negative feedback about the product you will base your report on:<br /><br /><br /><br /><br /><br />
                {feedback}</p><br /><br /><br /><br /><br />
<p>                    “””</p><br /><br /><br /><br /><br />
<p>    try:<br /><br /><br /><br /><br /><br />
        response = call_gpt_completion(prompt)</p><br /><br /><br /><br /><br />
<p>        report = “”</p><br /><br /><br /><br /><br />
<p>        for choice in response.choices:<br /><br /><br /><br /><br /><br />
            report += choice.message.content</p><br /><br /><br /><br /><br />
<p>    except Exception as e:<br /><br /><br /><br /><br /><br />
        print(“An error occurred:”, str(e))</p><br /><br /><br /><br /><br />
<p>    return report

def create_negative_report(feedback, product_name):

print(“Generating Negative Report”)

prompt = f“””Create a customer sentiment analysis report that includes the following sections. Use <UL> and <LI> tags for listing items and for the titles of each section.

<HTML>

Analysis of Feedback

<UL><LI>Summarize recurring and common negative issues mentioned in the reviews.</LI></UL>

Recommendations

<UL><LI>Based on the analysis, suggest actionable measures the company can take to address the issues raised in the feedback.</LI></UL>

Conclusion

<UL><LI>Summarize the key findings of the report.</LI></UL>

</HTML>

The following is the list of the negative feedback about the product you will base your report on:

{feedback}

“””

try:

response = call_gpt_completion(prompt)

report = “”

for choice in response.choices:

report += choice.message.content

except Exception as e:

print(“An error occurred:”, str(e))

return report

Generate positive report (create_positive_report)

Creates a report based on positive feedback.

def create_positive_report(feedback, product_name):<br /><br /><br /><br /><br /><br />
    print(“Generating Positive Report”)</p><br /><br /><br /><br /><br />
<p>    prompt = f”””Create a customer sentiment analysis report that includes the following sections. Use  <UL> and <LI> tags for listing items and <strong> for the titles of each section. </p><br /><br /><br /><br /><br />
<p>                <HTML><br /><br /><br /><br /><br /><br />
                <strong>Detailed Analysis</strong><br /><br /><br /><br /><br /><br />
                <UL><LI>Summarize common positive feedback mentioned in the reviews.</LI></UL><br /><br /><br /><br /><br /><br />
                <strong>Recommendations </strong><br /><br /><br /><br /><br /><br />
                <UL><LI>Propose marketing strategies that leverage the positive aspects highlighted in the reviews. </LI></UL><br /><br /><br /><br /><br /><br />
                <strong>Conclusion </strong><br /><br /><br /><br /><br /><br />
                <UL><LI>Summarize the key findings and the overall sentiment of the customers towards the product.</LI></UL><br /><br /><br /><br /><br /><br />
                </HTML>                     </p><br /><br /><br /><br /><br />
<p>                The following is the list of the positive feedback about the product you will base your report on:</p><br /><br /><br /><br /><br />
<p>                {feedback}</p><br /><br /><br /><br /><br />
<p>                “””</p><br /><br /><br /><br /><br />
<p>    try:<br /><br /><br /><br /><br /><br />
        response = call_gpt_completion(prompt)</p><br /><br /><br /><br /><br />
<p>        report = “”</p><br /><br /><br /><br /><br />
<p>        for choice in response.choices:<br /><br /><br /><br /><br /><br />
            report += choice.message.content</p><br /><br /><br /><br /><br />
<p>    except Exception as e:<br /><br /><br /><br /><br /><br />
        print(“An error occurred:”, str(e))</p><br /><br /><br /><br /><br />
<p>    return report

def create_positive_report(feedback, product_name):

print(“Generating Positive Report”)

prompt = f“””Create a customer sentiment analysis report that includes the following sections. Use <UL> and <LI> tags for listing items and for the titles of each section.

<HTML>

Detailed Analysis

<UL><LI>Summarize common positive feedback mentioned in the reviews.</LI></UL>

Recommendations

<UL><LI>Propose marketing strategies that leverage the positive aspects highlighted in the reviews. </LI></UL>

Conclusion

<UL><LI>Summarize the key findings and the overall sentiment of the customers towards the product.</LI></UL>

</HTML>

The following is the list of the positive feedback about the product you will base your report on:

{feedback}

“””

try:

response = call_gpt_completion(prompt)

report = “”

for choice in response.choices:

report += choice.message.content

except Exception as e:

print(“An error occurred:”, str(e))

return report

HTML to formatted text (html_to_word)

Converts HTML content to formatted text in a Word document. It handles bold text and bullet lists.

def html_to_word(doc, html_content):<br /><br /><br /><br /><br /><br />
    soup = BeautifulSoup(html_content, ‘html.parser’)</p><br /><br /><br /><br /><br />
<p>    for element in soup.find_all([‘strong’, ‘ul’]):<br /><br /><br /><br /><br /><br />
        if element.name == ‘strong’:<br /><br /><br /><br /><br /><br />
            # Add bold text as a heading<br /><br /><br /><br /><br /><br />
            doc.add_paragraph(element.get_text().strip(), style=’Heading 2′)<br /><br /><br /><br /><br /><br />
        elif element.name == ‘ul’:<br /><br /><br /><br /><br /><br />
            for item in element.find_all(‘li’):<br /><br /><br /><br /><br /><br />
                # Add list items<br /><br /><br /><br /><br /><br />
                doc.add_paragraph(item.get_text().strip(), style=’List Bullet’)

def html_to_word(doc, html_content):

soup = BeautifulSoup(html_content, ‘html.parser’)

for element in soup.find_all([‘strong’, ‘ul’]):

if element.name == ‘strong’:

# Add bold text as a heading

doc.add_paragraph(element.get_text().strip(), style=‘Heading 2’)

elif element.name == ‘ul’:

for item in element.find_all(‘li’):

# Add list items

doc.add_paragraph(item.get_text().strip(), style=‘List Bullet’)

Add hyperlink (add_hyperlink)

Inserts a hyperlink into a Word document paragraph.

def add_hyperlink(paragraph, url, text):<br /><br /><br /><br /><br /><br />
    “””<br /><br /><br /><br /><br /><br />
    A function that places a hyperlink within a paragraph object.</p><br /><br /><br /><br /><br />
<p>    :param paragraph: The paragraph we are adding the hyperlink to.<br /><br /><br /><br /><br /><br />
    :param url: The URL the link points to.<br /><br /><br /><br /><br /><br />
    :param text: The text displayed for the link.<br /><br /><br /><br /><br /><br />
    “””<br /><br /><br /><br /><br /><br />
    part = paragraph.part<br /><br /><br /><br /><br /><br />
    r_id = part.relate_to(url, RELATIONSHIP_TYPE.HYPERLINK, is_external=True)</p><br /><br /><br /><br /><br />
<p>    hyperlink = OxmlElement(‘w:hyperlink’)<br /><br /><br /><br /><br /><br />
    hyperlink.set(qn(‘r:id’), r_id)</p><br /><br /><br /><br /><br />
<p>    new_run = OxmlElement(‘w:r’)<br /><br /><br /><br /><br /><br />
    rPr = OxmlElement(‘w:rPr’)</p><br /><br /><br /><br /><br />
<p>    # Set the style for the hyperlink (color, underline)<br /><br /><br /><br /><br /><br />
    c = OxmlElement(‘w:color’)<br /><br /><br /><br /><br /><br />
    c.set(qn(‘w:val’), ‘0000FF’)  # Blue color<br /><br /><br /><br /><br /><br />
    rPr.append(c)<br /><br /><br /><br /><br /><br />
    u = OxmlElement(‘w:u’)<br /><br /><br /><br /><br /><br />
    u.set(qn(‘w:val’), ‘single’)<br /><br /><br /><br /><br /><br />
    rPr.append(u)</p><br /><br /><br /><br /><br />
<p>    new_run.append(rPr)<br /><br /><br /><br /><br /><br />
    new_run.text = text<br /><br /><br /><br /><br /><br />
    hyperlink.append(new_run)<br /><br /><br /><br /><br /><br />
    paragraph._p.append(hyperlink)

def add_hyperlink(paragraph, url, text):

“””

A function that places a hyperlink within a paragraph object.

:param paragraph: The paragraph we are adding the hyperlink to.

:param url: The URL the link points to.

:param text: The text displayed for the link.

“””

part = paragraph.part

r_id = part.relate_to(url, RELATIONSHIP_TYPE.HYPERLINK, is_external=True)

hyperlink = OxmlElement(‘w:hyperlink’)

hyperlink.set(qn(‘r:id’), r_id)

new_run = OxmlElement(‘w:r’)

rPr = OxmlElement(‘w:rPr’)

# Set the style for the hyperlink (color, underline)

c = OxmlElement(‘w:color’)

c.set(qn(‘w:val’), ‘0000FF’) # Blue color

rPr.append(c)

u = OxmlElement(‘w:u’)

u.set(qn(‘w:val’), ‘single’)

rPr.append(u)

new_run.append(rPr)

new_run.text = text

hyperlink.append(new_run)

paragraph._p.append(hyperlink)

Add title placeholder (insert_titles_in_text)

Adds a placeholder for inserting the titles.

def insert_titles_in_text(text, reports):<br /><br /><br /><br /><br /><br />
    # Placeholder for inserting the titles<br /><br /><br /><br /><br /><br />
    placeholder = “[]”</p><br /><br /><br /><br /><br />
<p>    # Extracting the titles from the reports and formatting them with new lines<br /><br /><br /><br /><br /><br />
    titles = “\n”.join([report[‘title’] for report in reports])</p><br /><br /><br /><br /><br />
<p>    # Replacing the placeholder with the titles<br /><br /><br /><br /><br /><br />
    updated_text = text.replace(placeholder, titles)</p><br /><br /><br /><br /><br />
<p>    return updated_text

def insert_titles_in_text(text, reports):

# Placeholder for inserting the titles

placeholder = “[]”

# Extracting the titles from the reports and formatting them with new lines

titles = “\n”.join([report[‘title’] for report in reports])

# Replacing the placeholder with the titles

updated_text = text.replace(placeholder, titles)

return updated_text

Generate image (generate_article_image)

Generates a main image for the report using OpenAI’s DALL-E model with a specific prompt.

def generate_article_image(name):<br /><br /><br /><br /><br /><br />
    print(“Generating post image”)<br /><br /><br /><br /><br /><br />
    image_url = “”<br /><br /><br /><br /><br /><br />
    try:<br /><br /><br /><br /><br /><br />
        response = client.images.generate(<br /><br /><br /><br /><br /><br />
            model=”dall-e-3”,<br /><br /><br /><br /><br /><br />
            prompt=f”Create a professional and informative cover for a Customer Sentiment Analysis report. The image should feature a diverse range of emoticons, including happy, sad, and neutral faces, symbolizing different customer emotions. Include a large, digital-style bar graph in the center, illustrating varying levels of customer satisfaction, with each bar colored according to the sentiment it represents (green for positive, red for negative, yellow for neutral). The overall color scheme should be clean and professional, with a balance of bright and muted colors to convey a sense of data-driven analysis and insight. The image must not show any text. “,<br /><br /><br /><br /><br /><br />
            n=1,<br /><br /><br /><br /><br /><br />
            size=”1024×1024″<br /><br /><br /><br /><br /><br />
        )<br /><br /><br /><br /><br /><br />
        image_url = response.data[0].url</p><br /><br /><br /><br /><br />
<p>    except Exception as e:<br /><br /><br /><br /><br /><br />
        print(“An error occurred generating the image:”, str(e))</p><br /><br /><br /><br /><br />
<p>    return image_url

def generate_article_image(name):

print(“Generating post image”)

image_url = “”

try:

response = client.images.generate(

model=“dall-e-3”,

prompt=f“Create a professional and informative cover for a Customer Sentiment Analysis report. The image should feature a diverse range of emoticons, including happy, sad, and neutral faces, symbolizing different customer emotions. Include a large, digital-style bar graph in the center, illustrating varying levels of customer satisfaction, with each bar colored according to the sentiment it represents (green for positive, red for negative, yellow for neutral). The overall color scheme should be clean and professional, with a balance of bright and muted colors to convey a sense of data-driven analysis and insight. The image must not show any text. “,

n=1,

size=“1024×1024”

)

image_url = response.data[0].url

except Exception as e:

print(“An error occurred generating the image:”, str(e))

return image_url

Download image (add_image_from_base64)

Downloads an image from a URL and adds it to a Word document.

def add_image_from_base64(doc, image_url):<br /><br /><br /><br /><br /><br />
    response = requests.get(image_url)</p><br /><br /><br /><br /><br />
<p>    # Check if the request was successful<br /><br /><br /><br /><br /><br />
    if response.status_code == 200:<br /><br /><br /><br /><br /><br />
        image_stream = io.BytesIO(response.content)<br /><br /><br /><br /><br /><br />
        doc.add_picture(image_stream, width=docx.shared.Inches(6))<br /><br /><br /><br /><br /><br />
    else:<br /><br /><br /><br /><br /><br />
        print(f”Failed to download image. Status code: {response.status_code}”)

def add_image_from_base64(doc, image_url):

response = requests.get(image_url)

# Check if the request was successful

if response.status_code == 200:

image_stream = io.BytesIO(response.content)

doc.add_picture(image_stream, width=docx.shared.Inches(6))

else:

print(f“Failed to download image. Status code: {response.status_code}”)

Create Word doc (create_word_doc)

Assembles the various components into a formatted Word document.

def create_word_doc(file_name, title_text, image_url, product_details, intro, negative_report, positive_report, one_star_count, two_star_count, three_star_count, four_star_count, five_star_count):<br /><br /><br /><br /><br /><br />
    print(“Saving to word document”)</p><br /><br /><br /><br /><br />
<p>    total_count = one_star_count + two_star_count + three_star_count + four_star_count + five_star_count</p><br /><br /><br /><br /><br />
<p>    doc = docx.Document()</p><br /><br /><br /><br /><br />
<p>    # Add a title<br /><br /><br /><br /><br /><br />
    title = doc.add_paragraph()<br /><br /><br /><br /><br /><br />
    title.style = ‘Title'<br /><br /><br /><br /><br /><br />
    title_run = title.add_run(title_text)<br /><br /><br /><br /><br /><br />
    title_run.font.size = Pt(24)  # Set the font size<br /><br /><br /><br /><br /><br />
    title_run.font.name = ‘Arial (Body)’  # Set the font<br /><br /><br /><br /><br /><br />
    title.alignment = WD_ALIGN_PARAGRAPH.CENTER  # Center align the title</p><br /><br /><br /><br /><br />
<p>    if len(image_url) > 0:<br /><br /><br /><br /><br /><br />
        add_image_from_base64(doc, image_url)</p><br /><br /><br /><br /><br />
<p>    title = doc.add_paragraph(style=’Heading 1′)<br /><br /><br /><br /><br /><br />
    title_run = title.add_run(“Product Details”)<br /><br /><br /><br /><br /><br />
    title_run.font.name = ‘Arial (Body)’  # Set the font</p><br /><br /><br /><br /><br />
<p>    p = doc.add_paragraph()<br /><br /><br /><br /><br /><br />
    add_hyperlink(p, product_details[‘url’], product_details[‘name’])<br /><br /><br /><br /><br /><br />
    p.add_run(“\n* Price: {}\n* Total Reviews: {}\n* Aggregated Rating: {}\n* 1 Start Count: {}\n* 2 Start Count: {}\n* 3 Start Count: {}\n* 4 Start Count:{}\n* 5 Start Count: {}\n”.format(<br /><br /><br /><br /><br /><br />
        product_details[‘price’], product_details[‘review_count’], product_details[‘aggregated_rating’], one_star_count, two_star_count, three_star_count, four_star_count, five_star_count))</p><br /><br /><br /><br /><br />
<p>    title = doc.add_paragraph(style=’Heading 1′)<br /><br /><br /><br /><br /><br />
    title_run = title.add_run(“Introduction”)<br /><br /><br /><br /><br /><br />
    title_run.font.name = ‘Arial (Body)’  # Set the font</p><br /><br /><br /><br /><br />
<p>    doc.add_paragraph(intro)</p><br /><br /><br /><br /><br />
<p>    # Positive feedback<br /><br /><br /><br /><br /><br />
    title = doc.add_paragraph(style=’Heading 1′)<br /><br /><br /><br /><br /><br />
    title_run = title.add_run(“Positive Feedback Analysis”)<br /><br /><br /><br /><br /><br />
    positive_percentage = round(100 * ((four_star_count + five_star_count) / total_count))<br /><br /><br /><br /><br /><br />
    doc.add_paragraph(f”Around {positive_percentage}% gave the product a very positive feedback (four and five star rating). The following analysis summarizes the recurring themes and findings in the reviews for the product.  “)</p><br /><br /><br /><br /><br />
<p>    title_run.font.name = ‘Arial (Body)’  # Set the font<br /><br /><br /><br /><br /><br />
    html_to_word(doc, positive_report)</p><br /><br /><br /><br /><br />
<p>    # Negative feedback<br /><br /><br /><br /><br /><br />
    title = doc.add_paragraph(style=’Heading 1′)<br /><br /><br /><br /><br /><br />
    title_run = title.add_run(“Negative Feedback Analysis”)<br /><br /><br /><br /><br /><br />
    negative_percentage = round(100 * ((one_star_count + two_star_count) / total_count))<br /><br /><br /><br /><br /><br />
    doc.add_paragraph(f”Around {negative_percentage}% gave the product a very negative feedback (one and two star rating). The following analysis summarizes the recurring themes and findings in the reviews for the product.  “)<br /><br /><br /><br /><br /><br />
    title_run.font.name = ‘Arial (Body)’  # Set the font<br /><br /><br /><br /><br /><br />
    html_to_word(doc, negative_report)</p><br /><br /><br /><br /><br />
<p>    for paragraph in doc.paragraphs:<br /><br /><br /><br /><br /><br />
        for run in paragraph.runs:<br /><br /><br /><br /><br /><br />
            run.font.name = ‘Arial (Body)'</p><br /><br /><br /><br /><br />
<p>    # Save the document<br /><br /><br /><br /><br /><br />
    doc.save(file_name)

def create_word_doc(file_name, title_text, image_url, product_details, intro, negative_report, positive_report, one_star_count, two_star_count, three_star_count, four_star_count, five_star_count):

print(“Saving to word document”)

total_count = one_star_count + two_star_count + three_star_count + four_star_count + five_star_count

doc = docx.Document()

# Add a title

title = doc.add_paragraph()

title.style = ‘Title’

title_run = title.add_run(title_text)

title_run.font.size = Pt(24) # Set the font size

title_run.font.name = ‘Arial (Body)’ # Set the font

title.alignment = WD_ALIGN_PARAGRAPH.CENTER # Center align the title

if len(image_url) > 0:

add_image_from_base64(doc, image_url)

title = doc.add_paragraph(style=‘Heading 1’)

title_run = title.add_run(“Product Details”)

title_run.font.name = ‘Arial (Body)’ # Set the font

p = doc.add_paragraph()

add_hyperlink(p, product_details[‘url’], product_details[‘name’])

p.add_run(“\n* Price: {}\n* Total Reviews: {}\n* Aggregated Rating: {}\n* 1 Start Count: {}\n* 2 Start Count: {}\n* 3 Start Count: {}\n* 4 Start Count:{}\n* 5 Start Count: {}\n”.format(

product_details[‘price’], product_details[‘review_count’], product_details[‘aggregated_rating’], one_star_count, two_star_count, three_star_count, four_star_count, five_star_count))

title = doc.add_paragraph(style=‘Heading 1’)

title_run = title.add_run(“Introduction”)

title_run.font.name = ‘Arial (Body)’ # Set the font

doc.add_paragraph(intro)

# Positive feedback

title = doc.add_paragraph(style=‘Heading 1’)

title_run = title.add_run(“Positive Feedback Analysis”)

positive_percentage = round(100 * ((four_star_count + five_star_count) / total_count))

doc.add_paragraph(f“Around {positive_percentage}% gave the product a very positive feedback (four and five star rating). The following analysis summarizes the recurring themes and findings in the reviews for the product. “)

title_run.font.name = ‘Arial (Body)’ # Set the font

html_to_word(doc, positive_report)

# Negative feedback

title = doc.add_paragraph(style=‘Heading 1’)

title_run = title.add_run(“Negative Feedback Analysis”)

negative_percentage = round(100 * ((one_star_count + two_star_count) / total_count))

doc.add_paragraph(f“Around {negative_percentage}% gave the product a very negative feedback (one and two star rating). The following analysis summarizes the recurring themes and findings in the reviews for the product. “)

title_run.font.name = ‘Arial (Body)’ # Set the font

html_to_word(doc, negative_report)

for paragraph in doc.paragraphs:

for run in paragraph.runs:

run.font.name = ‘Arial (Body)’

# Save the document

doc.save(file_name)

AI + Python = a powerful automation tool

This script demonstrates an advanced use case of integrating AI-powered content generation with document automation in Python. It’s a comprehensive example of how combining various Python libraries with AI models can produce a powerful automation tool.

Download the example code and files:

The full Python script

import json<br /><br /><br /><br /><br /><br />
import glob<br /><br /><br /><br /><br /><br />
import docx<br /><br /><br /><br /><br /><br />
import requests<br /><br /><br /><br /><br /><br />
from openai import OpenAI<br /><br /><br /><br /><br /><br />
import os<br /><br /><br /><br /><br /><br />
import openai<br /><br /><br /><br /><br /><br />
from docx.shared import Pt<br /><br /><br /><br /><br /><br />
from bs4 import BeautifulSoup<br /><br /><br /><br /><br /><br />
import io<br /><br /><br /><br /><br /><br />
from docx.shared import Pt<br /><br /><br /><br /><br /><br />
from docx.enum.text import WD_ALIGN_PARAGRAPH<br /><br /><br /><br /><br /><br />
from docx.oxml.shared import OxmlElement, qn<br /><br /><br /><br /><br /><br />
from docx.opc.constants import RELATIONSHIP_TYPE</p><br /><br /><br /><br /><br />
<p>openai.api_key = os.getenv(“OPENAI_API_KEY”)<br /><br /><br /><br /><br /><br />
NUM_OF_REVIEWS = 50    </p><br /><br /><br /><br /><br />
<p>client = OpenAI()</p><br /><br /><br /><br /><br />
<p>def add_image_from_base64(doc, image_url):<br /><br /><br /><br /><br /><br />
    response = requests.get(image_url)</p><br /><br /><br /><br /><br />
<p>    # Check if the request was successful<br /><br /><br /><br /><br /><br />
    if response.status_code == 200:<br /><br /><br /><br /><br /><br />
        image_stream = io.BytesIO(response.content)<br /><br /><br /><br /><br /><br />
        doc.add_picture(image_stream, width=docx.shared.Inches(6))<br /><br /><br /><br /><br /><br />
    else:<br /><br /><br /><br /><br /><br />
        print(f”Failed to download image. Status code: {response.status_code}”)</p><br /><br /><br /><br /><br />
<p>def html_to_word(doc, html_content):<br /><br /><br /><br /><br /><br />
    soup = BeautifulSoup(html_content, ‘html.parser’)</p><br /><br /><br /><br /><br />
<p>    for element in soup.find_all([‘strong’, ‘ul’]):<br /><br /><br /><br /><br /><br />
        if element.name == ‘strong’:<br /><br /><br /><br /><br /><br />
            # Add bold text as a heading<br /><br /><br /><br /><br /><br />
            doc.add_paragraph(element.get_text().strip(), style=’Heading 2′)<br /><br /><br /><br /><br /><br />
        elif element.name == ‘ul’:<br /><br /><br /><br /><br /><br />
            for item in element.find_all(‘li’):<br /><br /><br /><br /><br /><br />
                # Add list items<br /><br /><br /><br /><br /><br />
                doc.add_paragraph(item.get_text().strip(), style=’List Bullet’)</p><br /><br /><br /><br /><br />
<p>def add_hyperlink(paragraph, url, text):<br /><br /><br /><br /><br /><br />
    “””<br /><br /><br /><br /><br /><br />
    A function that places a hyperlink within a paragraph object.</p><br /><br /><br /><br /><br />
<p>    :param paragraph: The paragraph we are adding the hyperlink to.<br /><br /><br /><br /><br /><br />
    :param url: The URL the link points to.<br /><br /><br /><br /><br /><br />
    :param text: The text displayed for the link.<br /><br /><br /><br /><br /><br />
    “””<br /><br /><br /><br /><br /><br />
    part = paragraph.part<br /><br /><br /><br /><br /><br />
    r_id = part.relate_to(url, RELATIONSHIP_TYPE.HYPERLINK, is_external=True)</p><br /><br /><br /><br /><br />
<p>    hyperlink = OxmlElement(‘w:hyperlink’)<br /><br /><br /><br /><br /><br />
    hyperlink.set(qn(‘r:id’), r_id)</p><br /><br /><br /><br /><br />
<p>    new_run = OxmlElement(‘w:r’)<br /><br /><br /><br /><br /><br />
    rPr = OxmlElement(‘w:rPr’)</p><br /><br /><br /><br /><br />
<p>    # Set the style for the hyperlink (color, underline)<br /><br /><br /><br /><br /><br />
    c = OxmlElement(‘w:color’)<br /><br /><br /><br /><br /><br />
    c.set(qn(‘w:val’), ‘0000FF’)  # Blue color<br /><br /><br /><br /><br /><br />
    rPr.append(c)<br /><br /><br /><br /><br /><br />
    u = OxmlElement(‘w:u’)<br /><br /><br /><br /><br /><br />
    u.set(qn(‘w:val’), ‘single’)<br /><br /><br /><br /><br /><br />
    rPr.append(u)</p><br /><br /><br /><br /><br />
<p>    new_run.append(rPr)<br /><br /><br /><br /><br /><br />
    new_run.text = text<br /><br /><br /><br /><br /><br />
    hyperlink.append(new_run)<br /><br /><br /><br /><br /><br />
    paragraph._p.append(hyperlink)</p><br /><br /><br /><br /><br />
<p>def insert_titles_in_text(text, reports):<br /><br /><br /><br /><br /><br />
    # Placeholder for inserting the titles<br /><br /><br /><br /><br /><br />
    placeholder = “[]”</p><br /><br /><br /><br /><br />
<p>    # Extracting the titles from the reports and formatting them with new lines<br /><br /><br /><br /><br /><br />
    titles = “\n”.join([report[‘title’] for report in reports])</p><br /><br /><br /><br /><br />
<p>    # Replacing the placeholder with the titles<br /><br /><br /><br /><br /><br />
    updated_text = text.replace(placeholder, titles)</p><br /><br /><br /><br /><br />
<p>    return updated_text</p><br /><br /><br /><br /><br />
<p>def generate_article_image(name):<br /><br /><br /><br /><br /><br />
    print(“Generating post image”)<br /><br /><br /><br /><br /><br />
    image_url = “”<br /><br /><br /><br /><br /><br />
    try:<br /><br /><br /><br /><br /><br />
        response = client.images.generate(<br /><br /><br /><br /><br /><br />
            model=”dall-e-3”,<br /><br /><br /><br /><br /><br />
            prompt=f”Create a professional and informative cover for a Customer Sentiment Analysis report. The image should feature a diverse range of emoticons, including happy, sad, and neutral faces, symbolizing different customer emotions. Include a large, digital-style bar graph in the center, illustrating varying levels of customer satisfaction, with each bar colored according to the sentiment it represents (green for positive, red for negative, yellow for neutral). The overall color scheme should be clean and professional, with a balance of bright and muted colors to convey a sense of data-driven analysis and insight. The image must not show any text. “,<br /><br /><br /><br /><br /><br />
            n=1,<br /><br /><br /><br /><br /><br />
            size=”1024×1024″<br /><br /><br /><br /><br /><br />
        )<br /><br /><br /><br /><br /><br />
        image_url = response.data[0].url</p><br /><br /><br /><br /><br />
<p>    except Exception as e:<br /><br /><br /><br /><br /><br />
        print(“An error occurred generating the image:”, str(e))</p><br /><br /><br /><br /><br />
<p>    return image_url</p><br /><br /><br /><br /><br />
<p>def call_gpt_completion(prompt):<br /><br /><br /><br /><br /><br />
    return client.chat.completions.create(<br /><br /><br /><br /><br /><br />
        model=”gpt-4-1106-preview”,<br /><br /><br /><br /><br /><br />
        max_tokens=4096,<br /><br /><br /><br /><br /><br />
        messages=[<br /><br /><br /><br /><br /><br />
            {“role”: “user”, “content”: prompt},<br /><br /><br /><br /><br /><br />
        ]<br /><br /><br /><br /><br /><br />
    )</p><br /><br /><br /><br /><br />
<p>def extract_points(reviews, sentiment):</p><br /><br /><br /><br /><br />
<p>    print(“Extract Points: ” + sentiment)</p><br /><br /><br /><br /><br />
<p>    points = []</p><br /><br /><br /><br /><br />
<p>    for review in reviews:</p><br /><br /><br /><br /><br />
<p>        review_text = review[‘title’] + “\n” + review[‘text’]<br /><br /><br /><br /><br /><br />
        prompt = f”The following is a {sentiment} review of a product, summarize in one bullet point the main {sentiment} feedback:\n{review_text}”<br /><br /><br /><br /><br /><br />
        summary = “”<br /><br /><br /><br /><br /><br />
        try:<br /><br /><br /><br /><br /><br />
            response = call_gpt_completion(prompt)</p><br /><br /><br /><br /><br />
<p>            for choice in response.choices:<br /><br /><br /><br /><br /><br />
                summary += choice.message.content<br /><br /><br /><br /><br /><br />
        except Exception as e:<br /><br /><br /><br /><br /><br />
            print(“An error occurred:”, str(e))</p><br /><br /><br /><br /><br />
<p>        points.append(summary)<br /><br /><br /><br /><br /><br />
        if len(points) == NUM_OF_REVIEWS:<br /><br /><br /><br /><br /><br />
            break</p><br /><br /><br /><br /><br />
<p>    return points</p><br /><br /><br /><br /><br />
<p>def generate_intro(product_name, product_description):<br /><br /><br /><br /><br /><br />
    print(“Generate post intro”)</p><br /><br /><br /><br /><br />
<p>    prompt = f”””<br /><br /><br /><br /><br /><br />
        Write a paragraph introducing a customer sentiment report about:<br /><br /><br /><br /><br /><br />
        Product name: {product_name}<br /><br /><br /><br /><br /><br />
        Product description: {product_description}</p><br /><br /><br /><br /><br />
<p>        The report is created automatically by using Webz.io eCommerce Reviews api and ChatGPT. The report is generated by calling the Webz.io eCommerce Reviews API for the reviews about {product_name}. It then splits the product reviews into positive and negative reviews. Following this step, it summarizes up to {NUM_OF_REVIEWS} reviews from both negative and positive reviews using ChatGPT to create a comprehensive list of posts. It then gives those lists to ChatGPT to create a comprehensive report highlighting both positive and negative feedback and provide a report based on the feedback.<br /><br /><br /><br /><br /><br />
        “””</p><br /><br /><br /><br /><br />
<p>    intro = “”<br /><br /><br /><br /><br /><br />
    try:<br /><br /><br /><br /><br /><br />
        response = call_gpt_completion(prompt)</p><br /><br /><br /><br /><br />
<p>        for choice in response.choices:<br /><br /><br /><br /><br /><br />
            intro += choice.message.content<br /><br /><br /><br /><br /><br />
    except Exception as e:<br /><br /><br /><br /><br /><br />
        print(“An error occurred:”, str(e))</p><br /><br /><br /><br /><br />
<p>    return intro</p><br /><br /><br /><br /><br />
<p>def generate_title(product_name):<br /><br /><br /><br /><br /><br />
    print(“Creating a title”)</p><br /><br /><br /><br /><br />
<p>    prompt = “Create a title for a customer sentiment report about the following product:\n” + product_name<br /><br /><br /><br /><br /><br />
    title_text = “”<br /><br /><br /><br /><br /><br />
    try:<br /><br /><br /><br /><br /><br />
        response = call_gpt_completion(prompt)</p><br /><br /><br /><br /><br />
<p>        for choice in response.choices:<br /><br /><br /><br /><br /><br />
            title_text += choice.message.content<br /><br /><br /><br /><br /><br />
    except Exception as e:<br /><br /><br /><br /><br /><br />
        print(“An error occurred:”, str(e))</p><br /><br /><br /><br /><br />
<p>    title_text = title_text.strip(” “).strip(‘\”‘)<br /><br /><br /><br /><br /><br />
    if title_text.startswith(“Title:”):  # Sometimes ChatGPT prefix the title with Title:<br /><br /><br /><br /><br /><br />
        return title_text[len(“Title:”):]</p><br /><br /><br /><br /><br />
<p>    return title_text</p><br /><br /><br /><br /><br />
<p>def create_negative_report(feedback, product_name):<br /><br /><br /><br /><br /><br />
    print(“Generating Negative Report”)</p><br /><br /><br /><br /><br />
<p>    prompt = f”””Create a customer sentiment analysis report that includes the following sections. Use  <UL> and <LI> tags for listing items and <strong> for the titles of each section.</p><br /><br /><br /><br /><br />
<p>            <HTML><br /><br /><br /><br /><br /><br />
            <strong>Analysis of Feedback</strong><br /><br /><br /><br /><br /><br />
            <UL><LI>Summarize recurring and common negative issues mentioned in the reviews.</LI></UL></p><br /><br /><br /><br /><br />
<p>            <strong>Recommendations</strong><br /><br /><br /><br /><br /><br />
            <UL><LI>Based on the analysis, suggest actionable measures the company can take to address the issues raised in the feedback.</LI></UL></p><br /><br /><br /><br /><br />
<p>            <strong>Conclusion</strong><br /><br /><br /><br /><br /><br />
            <UL><LI>Summarize the key findings of the report.</LI></UL></p><br /><br /><br /><br /><br />
<p>            </HTML></p><br /><br /><br /><br /><br />
<p>                The following is the list of the negative feedback about the product you will base your report on:</p><br /><br /><br /><br /><br />
<p>                {feedback}</p><br /><br /><br /><br /><br />
<p>                    “””</p><br /><br /><br /><br /><br />
<p>    try:<br /><br /><br /><br /><br /><br />
        response = call_gpt_completion(prompt)</p><br /><br /><br /><br /><br />
<p>        report = “”</p><br /><br /><br /><br /><br />
<p>        for choice in response.choices:<br /><br /><br /><br /><br /><br />
            report += choice.message.content</p><br /><br /><br /><br /><br />
<p>    except Exception as e:<br /><br /><br /><br /><br /><br />
        print(“An error occurred:”, str(e))</p><br /><br /><br /><br /><br />
<p>    return report</p><br /><br /><br /><br /><br />
<p>def create_positive_report(feedback, product_name):<br /><br /><br /><br /><br /><br />
    print(“Generating Positive Report”)</p><br /><br /><br /><br /><br />
<p>    prompt = f”””Create a customer sentiment analysis report that includes the following sections. Use  <UL> and <LI> tags for listing items and <strong> for the titles of each section. </p><br /><br /><br /><br /><br />
<p>                <HTML><br /><br /><br /><br /><br /><br />
                <strong>Detailed Analysis</strong><br /><br /><br /><br /><br /><br />
                <UL><LI>Summarize common positive feedback mentioned in the reviews.</LI></UL></p><br /><br /><br /><br /><br />
<p>                <strong>Recommendations </strong><br /><br /><br /><br /><br /><br />
                <UL><LI>Propose marketing strategies that leverage the positive aspects highlighted in the reviews. </LI></UL></p><br /><br /><br /><br /><br />
<p>                <strong>Conclusion </strong><br /><br /><br /><br /><br /><br />
                <UL><LI>Summarize the key findings and the overall sentiment of the customers towards the product.</LI></UL></p><br /><br /><br /><br /><br />
<p>                </HTML></p><br /><br /><br /><br /><br />
<p>                The following is the list of the positive feedback about the product you will base your report on:</p><br /><br /><br /><br /><br />
<p>                {feedback}</p><br /><br /><br /><br /><br />
<p>                “””</p><br /><br /><br /><br /><br />
<p>    try:<br /><br /><br /><br /><br /><br />
        response = call_gpt_completion(prompt)</p><br /><br /><br /><br /><br />
<p>        report = “”</p><br /><br /><br /><br /><br />
<p>        for choice in response.choices:<br /><br /><br /><br /><br /><br />
            report += choice.message.content</p><br /><br /><br /><br /><br />
<p>    except Exception as e:<br /><br /><br /><br /><br /><br />
        print(“An error occurred:”, str(e))</p><br /><br /><br /><br /><br />
<p>    return report</p><br /><br /><br /><br /><br />
<p>def create_word_doc(file_name, title_text, image_url, product_details, intro, negative_report, positive_report, one_star_count, two_star_count, three_star_count, four_star_count, five_star_count):<br /><br /><br /><br /><br /><br />
    print(“Saving to word document”)</p><br /><br /><br /><br /><br />
<p>    total_count = one_star_count + two_star_count + three_star_count + four_star_count + five_star_count</p><br /><br /><br /><br /><br />
<p>    doc = docx.Document()</p><br /><br /><br /><br /><br />
<p>    # Add a title<br /><br /><br /><br /><br /><br />
    title = doc.add_paragraph()<br /><br /><br /><br /><br /><br />
    title.style = ‘Title'<br /><br /><br /><br /><br /><br />
    title_run = title.add_run(title_text)<br /><br /><br /><br /><br /><br />
    title_run.font.size = Pt(24)  # Set the font size<br /><br /><br /><br /><br /><br />
    title_run.font.name = ‘Arial (Body)’  # Set the font<br /><br /><br /><br /><br /><br />
    title.alignment = WD_ALIGN_PARAGRAPH.CENTER  # Center align the title</p><br /><br /><br /><br /><br />
<p>    if len(image_url) > 0:<br /><br /><br /><br /><br /><br />
        add_image_from_base64(doc, image_url)</p><br /><br /><br /><br /><br />
<p>    title = doc.add_paragraph(style=’Heading 1′)<br /><br /><br /><br /><br /><br />
    title_run = title.add_run(“Product Details”)<br /><br /><br /><br /><br /><br />
    title_run.font.name = ‘Arial (Body)’  # Set the font</p><br /><br /><br /><br /><br />
<p>    p = doc.add_paragraph()<br /><br /><br /><br /><br /><br />
    add_hyperlink(p, product_details[‘url’], product_details[‘name’])<br /><br /><br /><br /><br /><br />
    p.add_run(“\n* Price: {}\n* Total Reviews: {}\n* Aggregated Rating: {}\n* 1 Start Count: {}\n* 2 Start Count: {}\n* 3 Start Count: {}\n* 4 Start Count:{}\n* 5 Start Count: {}\n”.format(<br /><br /><br /><br /><br /><br />
        product_details[‘price’], product_details[‘review_count’], product_details[‘aggregated_rating’], one_star_count, two_star_count, three_star_count, four_star_count, five_star_count))</p><br /><br /><br /><br /><br />
<p>    title = doc.add_paragraph(style=’Heading 1′)<br /><br /><br /><br /><br /><br />
    title_run = title.add_run(“Introduction”)<br /><br /><br /><br /><br /><br />
    title_run.font.name = ‘Arial (Body)’  # Set the font</p><br /><br /><br /><br /><br />
<p>    doc.add_paragraph(intro)</p><br /><br /><br /><br /><br />
<p>    # Positive feedback<br /><br /><br /><br /><br /><br />
    title = doc.add_paragraph(style=’Heading 1′)<br /><br /><br /><br /><br /><br />
    title_run = title.add_run(“Positive Feedback Analysis”)<br /><br /><br /><br /><br /><br />
    positive_percentage = round(100 * ((four_star_count + five_star_count) / total_count))<br /><br /><br /><br /><br /><br />
    doc.add_paragraph(f”Around {positive_percentage}% gave the product a very positive feedback (four and five star rating). The following analysis summarizes the recurring themes and findings in the reviews for the product.  “)</p><br /><br /><br /><br /><br />
<p>    title_run.font.name = ‘Arial (Body)’  # Set the font<br /><br /><br /><br /><br /><br />
    html_to_word(doc, positive_report)</p><br /><br /><br /><br /><br />
<p>    # Negative feedback<br /><br /><br /><br /><br /><br />
    title = doc.add_paragraph(style=’Heading 1′)<br /><br /><br /><br /><br /><br />
    title_run = title.add_run(“Negative Feedback Analysis”)<br /><br /><br /><br /><br /><br />
    negative_percentage = round(100 * ((one_star_count + two_star_count) / total_count))<br /><br /><br /><br /><br /><br />
    doc.add_paragraph(f”Around {negative_percentage}% gave the product a very negative feedback (one and two star rating). The following analysis summarizes the recurring themes and findings in the reviews for the product.  “)<br /><br /><br /><br /><br /><br />
    title_run.font.name = ‘Arial (Body)’  # Set the font<br /><br /><br /><br /><br /><br />
    html_to_word(doc, negative_report)</p><br /><br /><br /><br /><br />
<p>    for paragraph in doc.paragraphs:<br /><br /><br /><br /><br /><br />
        for run in paragraph.runs:<br /><br /><br /><br /><br /><br />
            run.font.name = ‘Arial (Body)'</p><br /><br /><br /><br /><br />
<p>    # Save the document<br /><br /><br /><br /><br /><br />
    doc.save(file_name)</p><br /><br /><br /><br /><br />
<p>def read_ndjson_file(file_path):<br /><br /><br /><br /><br /><br />
    “””Reads an ndjson file and returns the content as a list of dictionaries.”””<br /><br /><br /><br /><br /><br />
    with open(file_path, ‘r’, encoding=’utf-8′) as file:<br /><br /><br /><br /><br /><br />
        return [json.loads(line) for line in file]</p><br /><br /><br /><br /><br />
<p>def main():<br /><br /><br /><br /><br /><br />
    # Path to the reviews folder<br /><br /><br /><br /><br /><br />
    reviews_folder = ‘reviews'</p><br /><br /><br /><br /><br />
<p>    # Read product information<br /><br /><br /><br /><br /><br />
    product_info = read_ndjson_file(os.path.join(reviews_folder, ‘Products.ndjson’))[0]</p><br /><br /><br /><br /><br />
<p>    # Initialize an empty list to store all reviews<br /><br /><br /><br /><br /><br />
    positive_reviews = []<br /><br /><br /><br /><br /><br />
    negative_reviews = []</p><br /><br /><br /><br /><br />
<p>    # Counters for each star rating<br /><br /><br /><br /><br /><br />
    one_star_count = 0<br /><br /><br /><br /><br /><br />
    two_star_count = 0<br /><br /><br /><br /><br /><br />
    three_star_count = 0<br /><br /><br /><br /><br /><br />
    four_star_count = 0<br /><br /><br /><br /><br /><br />
    five_star_count = 0</p><br /><br /><br /><br /><br />
<p>    # Read all reviews files in the reviews folder and add those with substantial text into separate lists<br /><br /><br /><br /><br /><br />
    for review_file in glob.glob(os.path.join(reviews_folder, ‘Reviews_*.ndjson’)):<br /><br /><br /><br /><br /><br />
        reviews = read_ndjson_file(review_file)<br /><br /><br /><br /><br /><br />
        for review in reviews:</p><br /><br /><br /><br /><br />
<p>            if review[‘rating’] == 1:<br /><br /><br /><br /><br /><br />
                one_star_count += 1<br /><br /><br /><br /><br /><br />
            elif review[‘rating’] == 2:<br /><br /><br /><br /><br /><br />
                two_star_count += 1<br /><br /><br /><br /><br /><br />
            elif review[‘rating’] == 3:<br /><br /><br /><br /><br /><br />
                three_star_count += 1<br /><br /><br /><br /><br /><br />
            elif review[‘rating’] == 4:<br /><br /><br /><br /><br /><br />
                four_star_count += 1<br /><br /><br /><br /><br /><br />
            elif review[‘rating’] == 5:<br /><br /><br /><br /><br /><br />
                five_star_count += 1</p><br /><br /><br /><br /><br />
<p>            if len(review[‘text’]) > 100:<br /><br /><br /><br /><br /><br />
                if review[‘rating’] <3: #  1-2 stars rating is negative<br /><br /><br /><br /><br /><br />
                    negative_reviews.append(review)<br /><br /><br /><br /><br /><br />
                if review[‘rating’] >3: # 4-5 starts reating is positive<br /><br /><br /><br /><br /><br />
                    positive_reviews.append(review)</p><br /><br /><br /><br /><br />
<p>    image_url = generate_article_image(product_info[‘name’])<br /><br /><br /><br /><br /><br />
    title = generate_title(product_info[‘name’])<br /><br /><br /><br /><br /><br />
    intro = generate_intro(product_info[‘name’], product_info[‘description’])</p><br /><br /><br /><br /><br />
<p>    positive_bullet_points = “\n”.join(extract_points(positive_reviews, ‘positive’))<br /><br /><br /><br /><br /><br />
    negative_bullet_points = “\n”.join(extract_points(negative_reviews, ‘negative’))</p><br /><br /><br /><br /><br />
<p>    positive_report = create_positive_report(positive_bullet_points, product_info[‘name’])<br /><br /><br /><br /><br /><br />
    negative_report = create_negative_report(negative_bullet_points, product_info[‘name’])</p><br /><br /><br /><br /><br />
<p>    create_word_doc(“customer sentiment analysis report.docx”, title, image_url, product_info , intro, negative_report, positive_report,<br /><br /><br /><br /><br /><br />
                    one_star_count, two_star_count, three_star_count, four_star_count, five_star_count)</p><br /><br /><br /><br /><br />
<p>    print(“done”)</p><br /><br /><br /><br /><br />
<p>if __name__ == “__main__”:<br /><br /><br /><br /><br /><br />
    main()

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

207

208

209

210

211

212

213

214

215

216

217

218

219

220

221

222

223

224

225

226

227

228

229

230

231

232

233

234

235

236

237

238

239

240

241

242

243

244

245

246

247

248

249

250

251

252

253

254

255

256

257

258

259

260

261

262

263

264

265

266

267

268

269

270

271

272

273

274

275

276

277

278

279

280

281

282

283

284

285

286

287

288

289

290

291

292

293

294

295

296

297

298

299

300

301

302

303

304

305

306

307

308

309

310

311

312

313

314

315

316

317

318

319

320

321

322

323

324

325

326

327

328

329

330

331

332

333

334

335

336

337

338

339

340

341

342

343

344

345

346

347

348

349

350

351

352

353

354

355

356

357

358

359

360

361

362

363

364

365

366

367

368

369

370

371

372

373

374

375

376

377

378

379

380

381

382

383

384

385

386

387

388

389

390

391

392

393

394

395

396

397

398

399

400

401

402

403

404

405

406

407

import json

import glob

import docx

import requests

from openai import OpenAI

import os

import openai

from docx.shared import Pt

from bs4 import BeautifulSoup

import io

from docx.shared import Pt

from docx.enum.text import WD_ALIGN_PARAGRAPH

from docx.oxml.shared import OxmlElement, qn

from docx.opc.constants import RELATIONSHIP_TYPE

openai.api_key = os.getenv(“OPENAI_API_KEY”)

NUM_OF_REVIEWS = 50

client = OpenAI()

def add_image_from_base64(doc, image_url):

response = requests.get(image_url)

# Check if the request was successful

if response.status_code == 200:

image_stream = io.BytesIO(response.content)

doc.add_picture(image_stream, width=docx.shared.Inches(6))

else:

print(f“Failed to download image. Status code: {response.status_code}”)

def html_to_word(doc, html_content):

soup = BeautifulSoup(html_content, ‘html.parser’)

for element in soup.find_all([‘strong’, ‘ul’]):

if element.name == ‘strong’:

# Add bold text as a heading

doc.add_paragraph(element.get_text().strip(), style=‘Heading 2’)

elif element.name == ‘ul’:

for item in element.find_all(‘li’):

# Add list items

doc.add_paragraph(item.get_text().strip(), style=‘List Bullet’)

def add_hyperlink(paragraph, url, text):

“””

A function that places a hyperlink within a paragraph object.

:param paragraph: The paragraph we are adding the hyperlink to.

:param url: The URL the link points to.

:param text: The text displayed for the link.

“””

part = paragraph.part

r_id = part.relate_to(url, RELATIONSHIP_TYPE.HYPERLINK, is_external=True)

hyperlink = OxmlElement(‘w:hyperlink’)

hyperlink.set(qn(‘r:id’), r_id)

new_run = OxmlElement(‘w:r’)

rPr = OxmlElement(‘w:rPr’)

# Set the style for the hyperlink (color, underline)

c = OxmlElement(‘w:color’)

c.set(qn(‘w:val’), ‘0000FF’) # Blue color

rPr.append(c)

u = OxmlElement(‘w:u’)

u.set(qn(‘w:val’), ‘single’)

rPr.append(u)

new_run.append(rPr)

new_run.text = text

hyperlink.append(new_run)

paragraph._p.append(hyperlink)

def insert_titles_in_text(text, reports):

# Placeholder for inserting the titles

placeholder = “[]”

# Extracting the titles from the reports and formatting them with new lines

titles = “\n”.join([report[‘title’] for report in reports])

# Replacing the placeholder with the titles

updated_text = text.replace(placeholder, titles)

return updated_text

def generate_article_image(name):

print(“Generating post image”)

image_url = “”

try:

response = client.images.generate(

model=“dall-e-3”,

n=1,

size=“1024×1024”

)

image_url = response.data[0].url

except Exception as e:

print(“An error occurred generating the image:”, str(e))

return image_url

def call_gpt_completion(prompt):

return client.chat.completions.create(

model=“gpt-4-1106-preview”,

max_tokens=4096,

messages=[

{“role”: “user”, “content”: prompt},

]

)

def extract_points(reviews, sentiment):

print(“Extract Points: “ + sentiment)

points = []

for review in reviews:

review_text = review[‘title’] + “\n” + review[‘text’]

prompt = f“The following is a {sentiment} review of a product, summarize in one bullet point the main {sentiment} feedback:\n{review_text}”

summary = “”

try:

response = call_gpt_completion(prompt)

for choice in response.choices:

summary += choice.message.content

except Exception as e:

print(“An error occurred:”, str(e))

points.append(summary)

if len(points) == NUM_OF_REVIEWS:

break

return points

def generate_intro(product_name, product_description):

print(“Generate post intro”)

prompt = f“””

Write a paragraph introducing a customer sentiment report about:

Product name: {product_name}

Product description: {product_description}

The report is created automatically by using Webz.io eCommerce Reviews api and ChatGPT. The report is generated by calling the Webz.io eCommerce Reviews API for the reviews about {product_name}. It then splits the product reviews into positive and negative reviews. Following this step, it summarizes up to {NUM_OF_REVIEWS} reviews from both negative and positive reviews using ChatGPT to create a comprehensive list of posts. It then gives those lists to ChatGPT to create a comprehensive report highlighting both positive and negative feedback and provide a report based on the feedback.

“””

intro = “”

try:

response = call_gpt_completion(prompt)

for choice in response.choices:

intro += choice.message.content

except Exception as e:

print(“An error occurred:”, str(e))

return intro

def generate_title(product_name):

print(“Creating a title”)

prompt = “Create a title for a customer sentiment report about the following product:\n” + product_name

title_text = “”

try:

response = call_gpt_completion(prompt)

for choice in response.choices:

title_text += choice.message.content

except Exception as e:

print(“An error occurred:”, str(e))

title_text = title_text.strip(” “).strip(‘\”‘)

if title_text.startswith(“Title:”): # Sometimes ChatGPT prefix the title with Title:

return title_text[len(“Title:”):]

return title_text

def create_negative_report(feedback, product_name):

print(“Generating Negative Report”)

prompt = f“””Create a customer sentiment analysis report that includes the following sections. Use <UL> and <LI> tags for listing items and for the titles of each section.

<HTML>

Analysis of Feedback

<UL><LI>Summarize recurring and common negative issues mentioned in the reviews.</LI></UL>

Recommendations

<UL><LI>Based on the analysis, suggest actionable measures the company can take to address the issues raised in the feedback.</LI></UL>

Conclusion

<UL><LI>Summarize the key findings of the report.</LI></UL>

</HTML>

The following is the list of the negative feedback about the product you will base your report on:

{feedback}

“””

try:

response = call_gpt_completion(prompt)

report = “”

for choice in response.choices:

report += choice.message.content

except Exception as e:

print(“An error occurred:”, str(e))

return report

def create_positive_report(feedback, product_name):

print(“Generating Positive Report”)

prompt = f“””Create a customer sentiment analysis report that includes the following sections. Use <UL> and <LI> tags for listing items and for the titles of each section.

<HTML>

Detailed Analysis

<UL><LI>Summarize common positive feedback mentioned in the reviews.</LI></UL>

Recommendations

<UL><LI>Propose marketing strategies that leverage the positive aspects highlighted in the reviews. </LI></UL>

Conclusion

<UL><LI>Summarize the key findings and the overall sentiment of the customers towards the product.</LI></UL>

</HTML>

The following is the list of the positive feedback about the product you will base your report on:

{feedback}

“””

try:

response = call_gpt_completion(prompt)

report = “”

for choice in response.choices:

report += choice.message.content

except Exception as e:

print(“An error occurred:”, str(e))

return report

def create_word_doc(file_name, title_text, image_url, product_details, intro, negative_report, positive_report, one_star_count, two_star_count, three_star_count, four_star_count, five_star_count):

print(“Saving to word document”)

total_count = one_star_count + two_star_count + three_star_count + four_star_count + five_star_count

doc = docx.Document()

# Add a title

title = doc.add_paragraph()

title.style = ‘Title’

title_run = title.add_run(title_text)

title_run.font.size = Pt(24) # Set the font size

title_run.font.name = ‘Arial (Body)’ # Set the font

title.alignment = WD_ALIGN_PARAGRAPH.CENTER # Center align the title

if len(image_url) > 0:

add_image_from_base64(doc, image_url)

title = doc.add_paragraph(style=‘Heading 1’)

title_run = title.add_run(“Product Details”)

title_run.font.name = ‘Arial (Body)’ # Set the font

p = doc.add_paragraph()

add_hyperlink(p, product_details[‘url’], product_details[‘name’])

p.add_run(“\n* Price: {}\n* Total Reviews: {}\n* Aggregated Rating: {}\n* 1 Start Count: {}\n* 2 Start Count: {}\n* 3 Start Count: {}\n* 4 Start Count:{}\n* 5 Start Count: {}\n”.format(

product_details[‘price’], product_details[‘review_count’], product_details[‘aggregated_rating’], one_star_count, two_star_count, three_star_count, four_star_count, five_star_count))

title = doc.add_paragraph(style=‘Heading 1’)

title_run = title.add_run(“Introduction”)

title_run.font.name = ‘Arial (Body)’ # Set the font

doc.add_paragraph(intro)

# Positive feedback

title = doc.add_paragraph(style=‘Heading 1’)

title_run = title.add_run(“Positive Feedback Analysis”)

positive_percentage = round(100 * ((four_star_count + five_star_count) / total_count))

title_run.font.name = ‘Arial (Body)’ # Set the font

html_to_word(doc, positive_report)

# Negative feedback

title = doc.add_paragraph(style=‘Heading 1’)

title_run = title.add_run(“Negative Feedback Analysis”)

negative_percentage = round(100 * ((one_star_count + two_star_count) / total_count))

title_run.font.name = ‘Arial (Body)’ # Set the font

html_to_word(doc, negative_report)

for paragraph in doc.paragraphs:

for run in paragraph.runs:

run.font.name = ‘Arial (Body)’

# Save the document

doc.save(file_name)

def read_ndjson_file(file_path):

“””Reads an ndjson file and returns the content as a list of dictionaries.”””

with open(file_path, ‘r’, encoding=‘utf-8’) as file:

return [json.loads(line) for line in file]

def main():

# Path to the reviews folder

reviews_folder = ‘reviews’

# Read product information

product_info = read_ndjson_file(os.path.join(reviews_folder, ‘Products.ndjson’))[0]

# Initialize an empty list to store all reviews

positive_reviews = []

negative_reviews = []

# Counters for each star rating

one_star_count = 0

two_star_count = 0

three_star_count = 0

four_star_count = 0

five_star_count = 0

# Read all reviews files in the reviews folder and add those with substantial text into separate lists

for review_file in glob.glob(os.path.join(reviews_folder, ‘Reviews_*.ndjson’)):

reviews = read_ndjson_file(review_file)

for review in reviews:

if review[‘rating’] == 1:

one_star_count += 1

elif review[‘rating’] == 2:

two_star_count += 1

elif review[‘rating’] == 3:

three_star_count += 1

elif review[‘rating’] == 4:

four_star_count += 1

elif review[‘rating’] == 5:

five_star_count += 1

if len(review[‘text’]) > 100:

if review[‘rating’] <3: # 1-2 stars rating is negative

negative_reviews.append(review)

if review[‘rating’] >3: # 4-5 starts reating is positive

positive_reviews.append(review)

image_url = generate_article_image(product_info[‘name’])

title = generate_title(product_info[‘name’])

intro = generate_intro(product_info[‘name’], product_info[‘description’])

positive_bullet_points = “\n”.join(extract_points(positive_reviews, ‘positive’))

negative_bullet_points = “\n”.join(extract_points(negative_reviews, ‘negative’))

positive_report = create_positive_report(positive_bullet_points, product_info[‘name’])

negative_report = create_negative_report(negative_bullet_points, product_info[‘name’])

create_word_doc(“customer sentiment analysis report.docx”, title, image_url, product_info , intro, negative_report, positive_report,

one_star_count, two_star_count, three_star_count, four_star_count, five_star_count)

print(“done”)

if __name__ == “__main__”:

main()

Free sample data (NDJSON format): Product dataset, Review dataset #1, Review dataset #2, Review dataset #3, Review dataset #4
Example of an auto-generated report in PDF format.

To run the script:

Ensure that Python and the required Python libraries are installed on your machine.=
Set your OpenAI API key in your development environment.
Place the NDJSON file with the product and review information in the specified directory.
Run the script.

Ready to automate customer sentiment analysis for your organization? Talk to one of our experts today.

Ben

Spread the news

How to Automate Customer Sentiment Analysis Reports: A Guide for Developers

What you’ll need to run the script

Automating customer sentiment analysis reports: script breakdown

Import files, packages, and modules

Set global variable and access API key

Orchestrate entire process (main)

Define functions

Read NDJSON file (read_ndjson_file)

Send prompt (call_gpt_completion)

Extract points (extract_points)

Generate title (generate_title)

Generate introduction (generate_intro)

Generate negative report (create_negative_report)

Generate positive report (create_positive_report)

HTML to formatted text (html_to_word)

Add hyperlink (add_hyperlink)

Add title placeholder (insert_titles_in_text)

Generate image (generate_article_image)

Download image (add_image_from_base64)

Create Word doc (create_word_doc)

AI + Python = a powerful automation tool

Download the example code and files:

To run the script:

Ben

Subscribe to our blog for more news and updates!

Power Your Insights with Data You Can Trust

Ready to Explore Web Data at Scale?