Sequence of tool development and LLM techniques learned or used#

1. Doc (First experiment)#

Basic prompt engineering techniques:

Capitalization in prompts to reinforce critical instructions
Two-step chunking strategy (chunk → meta summary)
Structural element extraction with tags (headings, tables, captions)
pymupdf4llm for LLM-friendly Markdown

2. Code Analyzer#

Specialization and hierarchical analysis:

Multi-perspective analysis: 4 separate specialized LLM analyses per file (business logic, technical aspects, interfaces, issue detection)
Hierarchical analysis approach across multiple abstraction levels (file→package→module→system)
Dialog-based requirement clarification: LLM actively asks questions
Asynchronous processing with semaphore-based rate limiting for many LLM calls

3. AI survey#

Initial agent workflows and structuring:

Single agent with workflow orchestration: Evaluation → Inquiry → Structuring
Structured JSON outputs as a basis for communication
Clarity score (0-1) for automatic evaluation
Pydantic for data validation in LLM outputs
Safeguards against infinite loops in JSON-driven systems

4. stt-helper#

Cascading workflows:

Three-stage processing cascade: Cleaning → Revision → Formatting
Focused single-purpose prompts (one task per stage)
Development interface for prompt optimization:
- Interactive prompt adjustment during processing
Insight into intermediate results of each stage
Character-based chunking for longer texts

5. ppt-helper#

Multi-agent architectures:

Two-agent architecture with clear roles (chat agent + artifact agent)
Structured JSON communication protocol between agents
Bidirectional “return channel”: Artifact agent can ask questions to chat agent (not just unidirectional)
Model selection based on prompt-following capabilities rather than just benchmarks
<1000 lines per file for LLM maintainability

6. Translate#

Scaling and context management:

Multi-stage process: Markdown conversion → chunking → parallel translation → composition
Context management system: Glossaries, translated key terms, chunk summaries are passed on to LLM
Asynchronous parallel API calls for performance
Uniform Markdown pipeline instead of format-specific processing
Consistency across chunk boundaries through context transfer

7. TalkToDocuments#

Extensive context utilization:

Utilization of large contexts without traditional chunking or vector databases
Tiktoken integration for token counting and visualization
Simple reference system [P1], [P2] instead of complex metadata
Intelligent content cleaning and deduplication before LLM processing
Direct retention of multiple documents in context (up to 20 documents)

8. TextTool#

State management and systematic documentation:

Implementation_Status document: LLM maintains implementation status itself after each step
State-less development: Each new interaction possible with complete context
Artifact-centered approach (input/output areas instead of dialog prompts)
Curated tool library with 12 optimized prompts without meta comments
Combination of prompt engineering + heuristic filtering: Prompt following reinforced by downstream filtering
Differentiated prompt engineering strategy: Structured aspects for good prompt following, weak continuous text instructions
Automatic title generation through separate LLM calls for history function

9. Web Helper#

Rapid prototyping and two-stage analysis:

Two-stage LLM processing: Content structure analysis → Suggestions for improvement → Application to content
JSON handling of large structured data (12 MB) with 256K token context
In-memory session processing (no persistent state)

10. Chart Tool#

Complex multi-agent architecture with pattern libraries:

Intent→Plan→Execute workflow (three-stage evolution)
IntentService for intent classification (modification, single chart, multiple charts, analysis)
PlanService translates intents into detailed execution plans
- ExecutionService coordinates with retry logic
Pattern libraries as a hybrid approach: 44 code patterns (23 implementation, 7 anti-patterns, 9 modification, 5 semantic)
LLM selects and adapts patterns based on data types
- Hybrid: LLM intelligence + templates
fix_code() method for LLM-based self-correction with error feedback
Retry logic: In case of errors, error message to LLM for code correction
Controlled code execution with restricted built-ins, predefined safe globals
Semantic analysis: SemanticColorHelper for natural language color specifications
Multi-service orchestration: Multiple specialized services work together

11. Personnel cost calculator#

Integrated hybrid architecture with state-based dialog guidance:

Strict hybrid architecture: LLM exclusively for parameter extraction, deterministic calculations completely LLM-free (Python Decimal for exact arithmetic)
LLM as an intelligent interface: Bridge between natural language and structured processing
State machine for dialog guidance: 7 defined states (INITIAL→PARSING→CLARIFYING→CALCULATING→COMPLETE, plus FALLBACK/ERROR)
Three-stage fallback mechanism: After 3 parse errors, automatic manual form with pre-filled values
JSON interface with Pydantic: Clear contract definition between LLM and backend
Structured requirements gathering: Special prompt guides department through systematic questions before development
Specification-first approach: 50 min. specification enabled 40 min. implementation (5700 lines in one round)

Text Style Editor#

Control-based text transformation with a two-step process:

Two-step process: Neutralisation (10 dimensions) → Stylisation (34 controls in 7 categories)
Three slider types: Polar sliders (-10 to +10), intensity sliders (0-10) and step sliders (discrete options)
Intensity levels: Precise control from ‘light’ (1-2) to ‘extreme’ (9-10)
23 presets: Predefined slider combinations for typical use cases
Hash code export/import: Persistence of settings across sessions
Flat architecture: 6 Python modules (app.py, config.py, llm_client.py, models.py, prompt_builder.py, token_counter.py)
Prompt builder: Context-specific LLM prompt generation based on controller settings
Technical stack: Python, Gradio 6, OpenAI-compatible API (vLLM), tiktoken
Methodological approach: Detailed specification (1,400 lines) prior to implementation to reduce iteration loops

Development line of LLM techniques:#

Phase 1 - Basics (Doc, Code Analyzer):

Prompt engineering basics (capitalization, double reinforcement)
Chunking strategies
Multi-perspective analysis
Dialogue-based interaction

Phase 2 - Workflows (AI survey, stt-helper):

Structured JSON outputs with Pydantic
Cascading focused prompts
Workflow orchestration by LLM
Development interfaces for prompt iteration

Phase 3 - Multi-agents (ppt-helper):

Two-agent architectures
Bidirectional structured communication
Prompt following as a selection criterion

Phase 4 - Scaling (Translate, TalkToDocuments):

Context management systems
Large context usage (256K tokens)
Parallel API calls
Token tracking

Phase 5 - Systematization (TextTool, Web-Helper):

LLM maintains its own metadata (Implementation_Status)
Artifact-centered approaches
Two-stage analysis workflows

Phase 6 - Complex systems (Chart-Tool):

Intent→Plan→Execute Pattern
Pattern libraries as a hybrid approach
Self-correction mechanisms
Multi-service orchestration
Semantic analysis components

Phase 7 - Integrated architectures (personnel cost calculator):

Strict separation of LLM/deterministic logic as an architectural principle
State machines for robust dialog control
Fallback mechanisms for LLM uncertainty
Specification-first as a development methodology
Synthesis of earlier techniques (JSON/Pydantic, workflows, dialog control) into a robust overall system