NewsSummarizer

NewsSummarizer reads article URLs from an Excel file, scrapes the article content, generates a consolidated AI news brief using a local Ollama model, and emails the final summary.

Features

Reads news links from NewsLinks.xlsx
Scrapes article content using crawl4ai
Summarizes all stories via local Ollama model (llama3.2:3b by default)
Produces a markdown-style newsletter summary with inline source links
Sends the summary by email (plain text + HTML)
Falls back to printing summary in console if email credentials are missing

Project Requirements

System

Python 3.10+
Ollama installed and running locally
Chromium browser support for crawler runtime (playwright install)

Python Packages

Install required packages:

pip install langgraph langchain_ollama langchain-core crawl4ai pandas openpyxl python-dotenv

Install Playwright browser dependencies:

playwright install

Ollama Model Setup

Start Ollama and pull the model used by default:

ollama pull llama3.2:3b

You can override the model with OLLAMA_MODEL in .env.

Configuration

Create a .env file in project root:

EMAIL_ADDRESS=your_email@gmail.com
EMAIL_PASSWORD=your_gmail_app_password
RECIPIENT_EMAIL=recipient@gmail.com
OLLAMA_MODEL=llama3.2:3b

Notes:

EMAIL_PASSWORD should be a Gmail App Password, not your account password.
If email variables are missing, the script prints the summary to console instead of sending mail.

Input File Format

The script expects an Excel file named NewsLinks.xlsx in project root with a column named exactly:

URL

Example:

URL
https://example.com/article-1
https://example.com/article-2

How It Works (End-to-End)

Pipeline implemented in Agent.py:

Read URLs from NewsLinks.xlsx (read_excel_node)
Scrape each URL and extract text (scrape_links_node)
Summarize all scraped content with Ollama (summarize_node)
Email summary as newsletter (send_email_node)

The flow is orchestrated using LangGraph:

START -> read_excel -> scrape -> summarize -> email -> END

Running the Project

From project root:

python Agent.py

Output

Success path: summary is emailed to RECIPIENT_EMAIL
Partial scrape failures: failed URLs are skipped; successful articles are still summarized
No scrape success: execution stops with error
Email send failure: summary is printed in terminal

Troubleshooting

ModuleNotFoundError: install missing package(s) with pip install ...
Playwright/browser issues: run playwright install
Ollama connection/model errors:
- ensure Ollama daemon is running
- ensure model exists (ollama pull llama3.2:3b)
Excel errors:
- verify file is named NewsLinks.xlsx
- verify URL column exists and contains valid links
Gmail auth errors:
- confirm App Password is used
- confirm account allows app password usage

Security Recommendations

Do not commit .env to source control
Rotate credentials immediately if exposed
Use a dedicated sender mailbox for automation

pip install -r requirements.txt

uvicorn app:app --reload

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
OldAgent.py		OldAgent.py
README.md		README.md
agent_graph.py		agent_graph.py
app.py		app.py
config.py		config.py
crew_agents.py		crew_agents.py
email_sender.py		email_sender.py
evaluation.py		evaluation.py
guardrails.py		guardrails.py
ingest.py		ingest.py
rag_pipeline.py		rag_pipeline.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NewsSummarizer

Features

Project Requirements

System

Python Packages

Ollama Model Setup

Configuration

Input File Format

How It Works (End-to-End)

Running the Project

Output

Troubleshooting

Security Recommendations

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NewsSummarizer

Features

Project Requirements

System

Python Packages

Ollama Model Setup

Configuration

Input File Format

How It Works (End-to-End)

Running the Project

Output

Troubleshooting

Security Recommendations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages