Problem-Solving Guide

How to Think FlowMason

A comprehensive visual guide to transforming any problem into a working pipeline. Master the mental models, patterns, and decision processes used by pipeline architects.

The Core Mental Model

Every complex task is just a series of simple steps connected by data flow.

Your job: identify the steps, pick the right component for each, and wire them together.

Mental Models for Pipeline Thinking

Use these analogies to reason about your pipelines

The Assembly Line

Think of your pipeline as a factory assembly line. Raw materials (input) enter, pass through stations (stages), and finished products (output) emerge.

Input → Station 1 → Station 2 → Station 3 → Output

💡 Each station adds value. Stations can run in parallel if they work on independent parts.

The River Delta

Data flows like water. It can split into parallel streams, be filtered, and merge back together.

→→→ [process A] →→→ →→ →→→ [merge] →→→ →→→ [process B] →→→

💡 Look for natural split and merge points. What can flow independently?

The Kitchen

Like cooking a complex meal. Some things prep in parallel, some must be sequential, some need quality checks before proceeding.

Prep vegetables (parallel) → Cook main (sequential) → Taste test (conditional) → Plate

💡 What can be prepped ahead? What's the critical path? Where are the quality gates?

The 6-Step Design Process

Follow this systematic approach for any pipeline you need to build

1 Define Your Goal

2 Identify Your Inputs

3 Decompose Into Steps

4 Choose Components

5 Map Dependencies

6 Build & Iterate

Define Your Goal

What outcome do you want?

Start with the end in mind. Clearly articulate what the pipeline should produce. This becomes your north star for every decision.

🧠 Ask Yourself

? What problem am I solving?

? What does success look like?

? What format should the output be in?

? Who or what will consume this output?

📋 Examples

Input:

Raw customer feedback

Output:

Sentiment analysis + categorization

Why:

Need to route to right team

Input:

Blog topic idea

Output:

Published-ready article

Why:

Content automation

Input:

API response data

Output:

Validated, transformed records

Why:

ETL process

⚠️ Common Mistakes to Avoid

✗ Vague goals like 'process the data'
✗ Multiple unrelated outputs from one pipeline
✗ Not considering the consumer of the output

💡

Pro Tip: Write it down: 'Given [specific input], I want [specific output] because [reason]'

Identify Your Inputs

What data are you starting with?

Document everything you're feeding into the pipeline. Be specific about structure, types, and what's required vs optional. This becomes your input_schema.

🧠 Ask Yourself

? What data do I have available?

? What format is it in? (JSON, text, array, etc.)

? Which fields are required vs optional?

? What are the constraints? (length, format, etc.)

📥 Common Input Types

📝

Text/String

Documents, prompts, messages

📊

Structured Data

JSON objects, form data, records

📚

Arrays/Lists

Collection of items to process

🌐

External Sources

API endpoints, database queries

⚠️ Common Mistakes to Avoid

✗ Accepting 'any' without validation
✗ Not documenting required fields
✗ Ignoring edge cases (empty, null, very long)

💡

Pro Tip: Create your input schema FIRST - it forces clarity about what you actually need

Decompose Into Steps

What transformations are needed?

Break the journey from input to output into discrete, atomic operations. Each step should do ONE thing well. Think of it as assembly line stations.

🧠 Ask Yourself

? If I had to explain this to a human, what would the steps be?

? Which steps require intelligence (AI) vs pure logic?

? Can any steps run in parallel (independent of each other)?

? What's the minimum viable path?

🔬 Example Decomposition

Problem: Generate a personalized product recommendation email

Bad Approach

One big AI prompt that does everything

Good Approach

→ 1. Fetch user purchase history
→ 2. Analyze preferences (AI)
→ 3. Get matching products from catalog
→ 4. Rank products for user (AI)
→ 5. Generate personalized email copy (AI)
→ 6. Format as HTML template

⚠️ Common Mistakes to Avoid

✗ Mega-prompts that try to do everything
✗ Steps with unclear boundaries
✗ Mixing AI and data tasks unnecessarily

💡

Pro Tip: If a step description has 'and' in it, it should probably be two steps

Choose Components

Which FlowMason component fits each step?

Map each step from your decomposition to the right component type. This is where understanding the three categories is crucial.

🧩 Component Categories

🧠

AI Nodes

When you need understanding, creativity, or generation

generator Generate text, analyze, summarize, extract

critic Evaluate and score content

improver Refine based on feedback

selector Choose best from options

synthesizer Combine multiple inputs

⚙️

Operators

When you need deterministic, repeatable logic

http_request Call external APIs

json_transform Reshape data with JMESPath

filter Filter arrays by condition

schema_validate Validate against JSON Schema

variable_set Store values for reuse

logger Debug output at any point

🔀

Control Flow

When you need branching, looping, or error handling

conditional If/else branching

router Switch/case multi-path

foreach Loop over array items

trycatch Error handling with fallback

subpipeline Call another pipeline

💡

Pro Tip: When in doubt: Does it need 'thinking'? → Node. Just data manipulation? → Operator.

Map Dependencies

What depends on what?

Draw the connections between stages. This creates your DAG (Directed Acyclic Graph). Stages without dependencies run in parallel automatically.

📐 Dependency Rules

📥 Add depends_on when you need data from another stage

⚡ No depends_on = runs immediately in parallel

⏳ Multiple dependencies = waits for ALL to complete

🔗 Reference upstream data with {{stages.id.output}}

💡

Pro Tip: Draw your DAG on paper first. It reveals parallelization opportunities and missing connections.

Build & Iterate

Does it work?

Start minimal, test incrementally, and expand. Don't try to build the complete pipeline at once.

🏗️ Build in Phases

1 Skeleton

1-2 stages, happy path only

Does basic flow work?

2 Core Logic

Add remaining stages

Does each component work?

3 Error Handling

Add trycatch, validation

What can go wrong?

4 Optimization

Add parallelism, caching

Can it be faster?

💡

Pro Tip: Use FlowMason Studio's debugger to step through execution and inspect data at each stage

Component Decision Flowchart

Not sure which component to use? Follow this visual decision tree:

1 Does this step require understanding, creativity, or language?

Yes →

Use an AI Node

generatorcriticimprover