Documentation

Automation

Learn how Dirstarter uses AI and third-party services for automating directory content generation

Dirstarter uses AI and third-party services for automating directory content generation.

How It Works

We use a set of tools to automate the content generation. This speeds up the process and ensures a consistent output.

Here's a step-by-step guide on how the automation works:

A tool with URL is added to the directory

Admin reviews the tool and schedules it for publishing

Jina AI scrapes the website and generates a structured JSON object

Google Gemini generates structured data and categorises the content

Screenshot of the website is taken using Screenshot API

Favicon is automatically extracted from the website

The generated content is saved to your database

Setup

In order to use the automation, you need to set up the following:

Jina AI

We use Jina AI to scrape the website and generate a structured JSON object. They provide a very generous free tier without an API key, but if you run out of credits, you can create a API key to get 1,000,000 free tokens.

.env
JINA_API_KEY=your_api_key

Google Gemini

Create a Google Gemini account and add the following environment variable:

We chose Gemini as a default model because it's the most cost-effective option. If you need more powerful model, you can use Claude or any other LLM supported by Vercel AI SDK.

.env
GEMINI_API_KEY=your_api_key

Customizing Generation

You can customize the generation by:

  1. Modifying the system prompt in the /api/ai/generate-content function
  2. Adjusting the schema to include additional fields
  3. Changing the temperature parameter (lower for more consistent results)
Edit on GitHub

Last updated on

On this page