My TAFFISH Story

1. From genome and disease questions to reusable work

TAFFISH was not first imagined as a software engineering project, and it was not designed as a workflow platform from the beginning. Its origin was a biological question.

My training moved across several borders: physics as an undergraduate, bioinformatics during my master's study, and later electronic information and system design during doctoral work. That path kept me at an intersection. On one side were biological questions. On the other were data, tools, workflows, and computing environments.

The earliest question was not about tools. It was about the relationship between genome information and disease. If the genome state of a cell or sample is complex and multi-layered, could multiple omics layers help recover a more realistic picture of it?

At that stage, I roughly understood genome information through several layers: one-dimensional sequence information such as WGS, SNP, and CNV; three-dimensional spatial information such as Hi-C or 3C-derived technologies; and epigenetic information such as DNA methylation and histone modifications. These layers are not isolated. Sequence changes may affect three-dimensional structure, three-dimensional structure may affect gene expression, and epigenetic marks participate in regulatory processes.

That was a natural biological research route. But once the work began, another problem quickly became visible: too much time was spent on data cleaning, tool installation, parameter adjustment, format conversion, and repeating processes that should have been preserved after being solved once.

That was the first root of TAFFISH.

2. From biological questions to tool problems

In bioinformatics, researchers often care most about biological conclusions, but much of the actual effort goes into the computational machinery around those conclusions. A project may combine Hi-C, RNA-seq, variant data, and annotations. Each data type has its own format. Each step has its own tool. Each tool has its own dependencies, versions, and parameters.

A workflow may run on one server and fail on another. One person may know how to run it, while the next person has to rediscover the paths, environments, and scripts. Gradually, I realized that this work has inheritance value. If a research process accumulates experience, that experience should not remain only in personal scripts or memory. It should be saved, reused, and passed on.

So the first idea was simple: build a multi-omics analysis tool, integrate existing tools, collect common processing steps, and make them easier for myself and nearby users to run. At the same time, I was learning Common Lisp and writing small data-processing programs, for example to search nearby information around selected genes or loci across multi-omics results.

This was not TAFFISH yet, but its shadow was already there: command-line work, data processing, tool integration, packaged execution, and a lower barrier for reuse.

3. Looking at Nextflow and Snakemake

I had also looked at existing workflow systems, especially Nextflow and Snakemake. Looking back, I see them as mature, powerful, and important tools. They solve many large workflow management problems in bioinformatics.

At that time, I was still learning Linux, Shell, bioinformatics tools, server environments, and programming while starting from biological questions. I could feel that Nextflow and Snakemake were powerful, but it was still difficult for me to make those complete systems part of my daily analysis practice.

So I chose the simplest route available to me at that stage: write Shell, call local tools, and use Common Lisp for helper programs. Shell was not perfect, and those systems were not wrong. Shell was the tool I could use immediately. It was direct, close to the command-line tools themselves, and it ran.

Later, this Shell-plus-Lisp style revealed its own limits. Scripts record processes but do not manage environments well. Small tools simplify operations but do not unify installation. Running locally does not mean running elsewhere. Being understandable to one person does not mean being maintainable by a team. The problem returned in a new form.

4. From single tools to modular thinking

The original multi-omics tool idea was too broad to finish in one step. I began to break it down: start with a single omics field, especially familiar Hi-C or 3D genome analysis, and then expand toward other omics layers.

This was important because it showed that a sustainable system cannot be completed by one large design. It has to be modular. Existing tools should be embedded. Intermediate formats should be converted. Inputs and outputs should be adjusted. The system should not bind itself to one fixed tool, but should allow replacement, composition, and extension.

This direction has stayed with TAFFISH ever since. The goal is not to replace every existing tool, but to make existing command-line tools easier to package, reuse, distribute, and inherit.

5. BioFlow and BioHub: early forms of workflows and a tool library

As tool integration continued, BioFlow and BioHub appeared. BioFlow focused on workflows. BioHub started to mean a tool library and platform. The problems became clearer: installation was hard, usage was hard, composition was hard, management was weak, inheritance was fragile, and standards were missing.

At that point, the question was no longer only how to write a workflow. It was how to manage many tools and many workflows across environments and users. This is why TAFFISH-HUB later became necessary.

6. TAFFISH as a lightweight enhancement over Shell

BioFlow and BioHub gradually evolved into TAFFISH. One important realization was that “bio” was the current application scene, not the whole core of the project.

TAFFISH means Tools And Flows Framework Intensify SHell. The most important word in that name is Shell. TAFFISH should not leave Shell behind completely. In bioinformatics, many tools are already command-line tools. If reproducibility requires users to learn a heavy new language, the system creates another burden.

The direction became: keep the shape of Shell, and add semantics only where needed.

ARGS
  <(--/-i)input>
RUN
<container:ghcr.io/taffish/example:1.0.0>
  tool --input ::input::

The command still looks close to Shell. ::input:: performs parameter substitution. <container:...> declares the runtime environment. TAFFISH can then handle parameters, containers, paths, and compilation behind the scenes.

TAFFISH is not meant to replace Shell. It is meant to strengthen Shell where Shell is weak: environment boundaries, parameter structure, app packaging, and distribution.

7. Old TAFFISH: usable prototype and engineering boundaries

The old TAFFISH soon became runnable. It could compile taf scripts into shell, run tools through containers, and expose wrapped tools as taf-xxx commands so users could call them like ordinary commands.

That proved the original idea was feasible. The old version also exposed the engineering boundaries of the prototype stage. Early tag design was relatively complex, and the feature set tried to cover too many cases at once. Later, many of those functions were deliberately folded back into a smaller core, so the language semantics and runtime model could become easier to maintain and easier to understand.

GUI was an exploration that arrived a little too early. I thought about GUI very early, because many bioinformatics users are not familiar with the command line. If a graphical interface could select tools, fill parameters, and organize workflows, TAFFISH would be easier to use. But later I realized that before the CLI semantics and project structure were stable, a GUI would amplify the uncertainty underneath it.

So I temporarily put GUI aside and returned to CLI and core language design. Later, this proved necessary. Only when the CLI and core become stable can GUI be built on the right foundation.

8. Common Lisp, Rust, LispWorks, and engineering delivery

TAFFISH has mostly been written in Common Lisp. That is not the most common choice today, but it fits the problem. TAFFISH is not a regular business application. It is a small language system: it needs lexing, parsing, parameter binding, semantic expansion, and shell code generation.

Common Lisp's interactive development model, symbolic processing, and abstraction ability made it possible to test syntax ideas quickly and keep refactoring as the system changed. For TAFFISH, Common Lisp is not only an implementation language. It also shapes the way the system grows.

I also seriously considered rewriting TAFFISH in Rust. Rust has excellent engineering qualities, convenient binary distribution, and a type system suitable for long-term maintenance. That attempt was not wasted. Even though Rust did not become the implementation language, it influenced the newer TAFFISH structure.

taf new
taf check
taf build
taf run
taf publish

These commands carry a project-management mindset similar to Cargo. taffish.toml also acts as a project description file. The newer separation between core and cli layers, the project structure, the error boundaries, and the command behavior all benefited from that engineering perspective.

Eventually, I returned to Common Lisp. That was not a return to the starting point, but a confirmed choice after comparison. The core problem of TAFFISH is language design, semantic expansion, script generation, and interactive evolution. Common Lisp remains a strong fit for that.

LispWorks later became important for another reason: delivery. Early TAFFISH could rely on SBCL-generated executables for prototypes and personal use, but a tool intended for broader use has to be packaged, installed, updated, and run reliably in other users' environments. LispWorks gave TAFFISH a way to keep Common Lisp while solving Linux portability more seriously. The macOS side is not yet in that new route, but it can be adapted later.

LispWorks also made me think more seriously about future delivery, deployment, and possible graphical interfaces while still staying with Common Lisp. TAFFISH is currently centered on command-line tools, but if GUI work returns later, LispWorks may become an important candidate route.

From this perspective, TAFFISH has also been a process of re-understanding Common Lisp. At first, I used Lisp because it was expressive and good for rapid development. Later, I kept using Lisp because it fit a DSL and compiler-like system. Then, when I began considering LispWorks, I realized that Common Lisp could also enter a more formal software delivery stage.

TAFFISH eventually kept Lisp's flexibility while absorbing the engineering structure that Rust had taught me. It did not fully follow one language paradigm. It found its own shape through repeated trials. For me, TAFFISH is also a proof that Common Lisp can still be used to build new, real, usable scientific software systems.

9. New TAFFISH: from prototype to engineering code

The real turning point was the new refactor. This time, the goal was not only to prove that features could run. It was to reorganize the system boundaries and engineering structure:

lexer     read taf source
parser    parse ARGS, RUN, and tags
binder    bind parameters and context
emitter   compile each tag into shell fragments
compiler  generate final shell
cli       receive terminal input and call the core

After that, TAFFISH began to look more like a real compiler than a temporary scripting tool. <shell>, <container>, <taf-app>, and <taffish> are no longer scattered special cases. They are handled through emitters, making future tags and backends easier to add.

The newer design also supports scoped parameters such as @xxx:, which makes staged parameters, defaults, and context binding more natural in complex flows. It also prepares a foundation for a future CLI-GUI integrated route.

After this refactor, TAFFISH had a clearer engineering foundation.

10. Pipe support: bringing container tools back to Shell

One of the most important design confirmations in the new version is pipe support. Containerized tools can easily become boxes: they solve environment problems but lose the natural connection to Shell. TAFFISH should do the opposite. It should bring containerized tools back into ordinary Shell composition.

echo 123 | taf-test cat

When this worked, it confirmed an important design judgment: TAFFISH had not moved away from Shell. It had strengthened Shell. A taf app can be a stable, reproducible, distributable, and composable command-line part. Stable parts make stable workflows possible.

TAFFISH is not trying to replace Shell. TAFFISH strengthens Shell. It is also not trying to replace Nextflow, Snakemake, or Galaxy. TAFFISH focuses on another entry point: how each command-line tool can be packaged, installed, reproduced, and composed. When the parts are stable, workflows can become stable. When tools can be passed on, workflows can be passed on. When environments can be reproduced, results have a better chance of being reproduced too.

11. How my thinking shaped TAFFISH

TAFFISH has also been shaped by the way I think about systems. I have long been interested in Daoist thought, Zhuangzi, and the I Ching. They do not directly guide code implementation, but they do affect how I think about boundaries, position, growth, and change.

A good system is not necessarily the system with the most features. It is a system whose parts know what they are responsible for, whose boundaries are clear, and whose structure can keep growing.

Shell keeps composition.
Containers bound runtime environments.
taf-app packages tools.
taf-flow organizes workflows.
TAFFISH-HUB distributes tools and flows.
taf-cli manages creation, build, installation, and publish.

This is why some early complicated tags were removed, why GUI work was paused, and why an over-large design was avoided. TAFFISH does not need to swallow everything. It needs tools, containers, scripts, the Hub, and the CLI to each stay in the right place.

If TAFFISH has changed constantly over the years, what has not changed is the problem it wants to solve: making command-line tools and workflows in research easier to reproduce, migrate, distribute, and inherit. What has changed is the way I have tried to answer that problem.

12. From a tool to an ecosystem

TAFFISH is now more than a .taf compiler. It is becoming a layered ecosystem:

taffish-core      compiler core
taffish-cli       terminal entry point
taf-core / taf-cli project management, install, build, publish
taffish-hub       tool and flow ecosystem

In the future, users should be able to create projects with taf new, check project structure with taf check, build local commands with taf build, publish tools or flows with taf publish, and install apps from the Hub with taf install.

The road has included several turns. The project changed names, rejected designs, postponed attractive features, and went through code cleanup and heavy refactoring. At times, the pace of building such a system mostly alone felt slow, but those stages also made the system's boundaries clearer.

But every time a real tool runs, every time a cross-environment execution succeeds, and every time a messy process becomes a callable taf-xxx command, the answer becomes clearer. TAFFISH is not only a doctoral project to me. It is a road from biological questions to reusable scientific software.

TAFFISH began from a practical need: while doing biological research and multi-omics analysis, I did not want to repeatedly spend time on work that could be preserved and inherited. Later it became a larger question: can command-line tools in research be better packaged, reproduced, distributed, and passed on?

TAFFISH is still evolving. But it is no longer only an idea. It is now a system that can run, be tested, and be extended. It grew from biological thinking about genome and disease, passed through multi-omics tool ideas, BioFlow, and BioHub, and arrived at today's TAFFISH.

It also records my own changing understanding of research, engineering, languages, tools, and long-term work.

That road is still long. But now it is no longer only an idea. It has reached the ground underfoot.

1. 从“基因组到疾病”的问题开始

TAFFISH 最初并不是一个软件工程项目，也不是一开始就被设计成一个工作流平台。它的起点，是一个生物学问题。

本科阶段，我学习物理；硕士阶段，我进入生物信息学，开始接触基因组、转录组、三维基因组和多组学数据；博士阶段，我进入电子信息方向，也逐渐把更多注意力放到系统设计、软件工程和科研工具平台上。

这样的经历让我一直处在几个方向的交界处：一方面要理解生物问题，另一方面要处理数据、工具、流程和计算环境。TAFFISH 就是在这个交界处慢慢长出来的。

最早的时候，我关心的并不是工具，而是疾病与基因组之间的关系。我曾经思考过一个问题：如果一个细胞或样本中的基因组信息是复杂的、多维的，那么能不能通过多种组学数据，尽可能还原它更真实的状态？

在那个阶段，我把基因组信息大致理解为几个层面：一维序列信息，例如 WGS、SNP、CNV；三维空间信息，例如 Hi-C 或 3C 衍生技术；以及表观修饰信息，例如 DNA 甲基化和组蛋白修饰。它们不是完全孤立的。序列变化可能影响三维结构，三维结构可能影响基因表达，表观修饰又会参与调控过程。

所以，最初的问题并不是“我要写一个软件”，而是：能不能把多种组学信息联合起来，更好地理解疾病？

但真正开始做的时候，我很快发现，问题不只在生物学本身。大量时间会消耗在数据整理、工具安装、参数调整、格式转换和流程重复上。很多工作不是没有价值，但它们本可以被沉淀下来，不应该每一次都从头再来。

这也是 TAFFISH 最早的根。

2. 从生物学问题走向工具问题

做生物信息学分析时，经常会遇到一种情况：研究者真正关心的是生物学结论，但大量精力却花在计算过程的琐碎细节上。

例如，某个项目需要处理 Hi-C 数据、RNA-seq 数据、变异数据和注释数据。每一种数据都有自己的格式，每一个步骤都有自己的工具，每一个工具又有自己的依赖、版本和参数。一个流程可能在当前服务器上能跑，换一台服务器就失败；一个人能跑，另一个人接手时又要重新理解路径、环境和脚本。

我慢慢意识到，这些工作其实有很强的传承价值。如果我在研究过程中积累了一套经验，这套经验不应该只留在我自己的脚本和记忆里。它应该能够被保存、复用、传递。这样后面的人不需要重新走一遍相同的弯路。

于是，我开始想做一个多组学联合分析工具。一开始，这个想法很简单：把已有工具整合起来，把常见数据处理流程整理起来，让自己和身边的人都能更方便地使用。那时候我也开始学习 Common Lisp，并用它写过一些小的数据处理程序。比如在多组学结果中，根据指定基因或位点，批量搜索附近的相关信息。

这还不是 TAFFISH，但它已经具备了 TAFFISH 的一些早期影子：命令行、数据处理、工具整合、打包运行，以及降低使用门槛。

3. 我曾经看过 Nextflow 和 Snakemake

在更早的阶段，我也接触过一些已有工作流工具，例如 Nextflow 和 Snakemake。现在回头看，我会觉得它们是非常成熟、强大、重要的工具。它们在生物信息学工作流领域有很高价值，也解决了很多大规模流程管理问题。

那时我还在从生物学问题出发，学习 Linux、Shell、生信工具、服务器环境和编程。面对 Nextflow 和 Snakemake 这类完整系统，我能感觉到它们很强，但当时还很难真正把它们纳入自己的日常分析方式。

因此，我选择了当时最能立即使用的方式：写 Shell，调用本地工具，再用 Common Lisp 写一些辅助程序。

这不是因为 Shell 更完美，也不是因为 Nextflow 或 Snakemake 不好，而是因为在当时的认知和能力范围内，Shell 是我最能立即使用的工具。它简单，直接，能跑，也离命令行工具本身最近。

但后来，Shell 加 Lisp 小脚本的方式也暴露出了新问题。脚本可以记录流程，但很难管理环境；小工具可以简化操作，但很难统一安装；本地能跑，不代表别处能跑；一个人写得懂，不代表团队能长期维护。于是，问题又回来了。

4. 从单个工具到模块化思想

后来，我把最初的多组学联合分析工具计划进一步拆解。直接做一个完整的多组学联合分析工具，周期太长，难度也太高。于是我开始考虑先从单组学入手，尤其是较熟悉的 Hi-C 或三维基因组分析，再逐步扩展到其他组学方向。

这个阶段对 TAFFISH 很重要。因为它让我意识到，复杂系统不能靠一次性设计完成。真正可持续的系统，应该是模块化的。已有工具可以嵌入，中间格式可以转换，输入输出可以调整。系统不应该绑定某一个固定工具，而应该允许工具替换、流程组合和模块扩展。

这也是 TAFFISH 后来一直保留的方向：不要替代所有工具，而是让已有命令行工具更容易被封装、复用、分发和传递。

5. BioFlow 与 BioHub：流程和工具库的早期形态

随着工具整合的尝试继续推进，项目逐渐发展出 BioFlow 和 BioHub 的概念。BioFlow 更关注流程本身，BioHub 则开始关注工具库和平台。那时我逐渐把问题总结为几个方面：安装困难、使用困难、组合困难、缺乏管理、缺乏传承和缺乏标准。

这时的问题已经不只是“如何写一个流程”，而是“如何管理很多工具和很多流程”。一个脚本能跑，不代表它能交给别人跑；一个工具能装，不代表它能长期维护；一个流程能完成，不代表它能被复现。

BioFlow / BioHub 阶段让我逐渐看清：真正要解决的问题，是工具、流程、环境和使用方式的整体管理。这也是 TAFFISH-HUB 后来出现的原因。

6. TAFFISH：Shell 之上的轻量增强

后来，BioFlow / BioHub 逐渐演化为 TAFFISH。因为我也意识到 bio 并不是它的核心，只是它当前的应用场景而已。

TAFFISH 的全称是 Tools And Flows Framework Intensify SHell。这个名字里最重要的是 Shell。

我一直认为，TAFFISH 不应该完全脱离 Shell。Shell 是命令行工具最自然的使用方式，尤其是在生物信息学中，大量工具本来就是通过 Shell 命令运行的。如果为了可复现而让用户重新学习一套很重的语言，反而会增加新的负担。

所以 TAFFISH 的基本方向是：保留 Shell 的形态，只在必要的地方增加语义。

ARGS
  <(--/-i)input>
RUN
<container:ghcr.io/taffish/example:1.0.0>
  tool --input ::input::

在这个结构中，普通命令仍然像 Shell；::input:: 用于参数替换；<container:...> 用来指定运行环境。用户写的仍然是接近命令行的内容，但 TAFFISH 可以在背后处理参数、容器、路径和编译。

这个想法慢慢变得清晰：TAFFISH 不是要替代 Shell，而是增强 Shell。它把环境、参数和工具分发这些 Shell 本身不擅长处理的问题补上，但尽量不破坏 Shell 原有的使用习惯。

7. 旧版 TAFFISH：可用原型与工程化边界

旧版 TAFFISH 很快跑了起来。它可以把 taf 脚本编译成 shell，可以通过容器运行工具，也可以通过 taf-xxx 的形式让用户像使用普通命令一样使用封装后的工具。

这证明了最初的想法是可行的。旧版也暴露出原型阶段的工程边界。早期 tag 设计比较复杂，功能也试图一次性覆盖很多情况。后来我逐渐发现，很多功能可以被收束回更小的核心，让语言语义和运行模型更容易维护，也更容易被用户理解。

GUI 是一次走得有些早的探索。我很早就想过做 GUI，因为很多生信用户并不熟悉命令行。如果能通过图形界面选择工具、填写参数、组织流程，TAFFISH 会更容易被使用。但后来发现，在 CLI 语义和项目结构尚未稳定之前，GUI 会放大底层的不确定性。

所以，我暂时放下 GUI，把精力重新放回 CLI 和核心语言设计。后来事实也证明，这是必要的。只有 CLI 和 core 稳定，GUI 才能建立在正确的地基上。

8. Common Lisp、Rust、LispWorks 与工程交付

TAFFISH 一直主要使用 Common Lisp 开发。这不是最常见的选择。Common Lisp 没有 Rust、Python 或 JavaScript 那样庞大的现代生态，但它很适合 TAFFISH 这种系统。TAFFISH 本质上不是普通业务程序，而是一个小型语言系统：它需要词法分析、解析、参数绑定、语义展开和 shell 代码生成。

Common Lisp 的交互式开发方式、符号处理能力和抽象能力，让我可以快速验证语法想法，也可以在系统不断变化时保持较高的重构效率。对 TAFFISH 来说，Common Lisp 不只是实现语言，也影响了它的设计方式。

中间我也认真考虑过用 Rust 重写 TAFFISH。Rust 的工程能力很好，二进制分发方便，类型系统也适合长期维护。学习 Rust 的过程没有白费。虽然最后我没有用 Rust 重写 TAFFISH，但 Rust 对新版 TAFFISH 的结构产生了很大影响。

taf new
taf check
taf build
taf run
taf publish

这些命令都带有明显的项目管理思路。taffish.toml 也承担了类似项目描述文件的角色。新版 TAFFISH 的 core / cli 分层、项目结构、错误边界和命令行为，也受到了 Rust 工程习惯的影响。

最终，我还是回到了 Common Lisp。这个选择并不是回到原点，而是经过比较之后的重新确认。TAFFISH 的核心问题并不是普通系统编程，而是语言设计、语义展开、脚本生成和交互式演化。Common Lisp 在这些方面仍然非常适合我。

后来，我也开始尝试 LispWorks。对 TAFFISH 来说，这一步有特殊意义。早期 TAFFISH 可以用 SBCL 生成可执行文件，这已经足够支撑原型和自用版本。但当 TAFFISH 开始走向更正式的发布、分发和长期使用时，软件交付就变成了一个必须面对的问题。

LispWorks 给了我另一种可能：继续使用 Common Lisp，同时更认真地解决 Linux 端可移植性和工程交付问题。macOS 端暂时还没有纳入这条新的交付路线，但后续仍可以继续适配。从这个角度看，TAFFISH 的发展过程也是我重新理解 Common Lisp 的过程。

TAFFISH 当前仍以命令行工具为核心，但未来如果继续推进 GUI，LispWorks 也可能成为一个重要候选方向。它让我看到，Common Lisp 不只能用于快速实验和原型开发，也可以作为长期软件系统的实现基础。

一开始，我使用 Lisp 是因为它表达力强，适合快速开发；后来，我继续使用 Lisp，是因为它适合 TAFFISH 这种 DSL 和编译器式系统；再后来，当我开始考虑 LispWorks 时，我意识到 Common Lisp 也可以进入更正式的软件交付阶段。

TAFFISH 最终保留了 Lisp 的灵活性，也吸收了 Rust 带给我的工程结构意识。它没有完全走向某一种语言的范式，而是在不断试错中形成了自己的形状。对我来说，TAFFISH 也是一个证明：Common Lisp 仍然可以用于构建新的、真实可用的科研软件系统。

9. 新版 TAFFISH：从原型到工程代码

真正重要的转折，是新一轮重构。这一次，目标不只是验证功能可运行，而是重新整理系统边界和工程结构：

lexer     读取 taf 源码
parser    解析 ARGS、RUN 和 tag
binder    绑定参数和上下文
emitter   把不同 tag 编译成 shell 片段
compiler  生成最终 shell
cli       接收终端输入并调用 core

这让 TAFFISH 开始像一个真正的编译器，而不是一个临时脚本工具。新版中，<shell>、<container>、<taf-app>、<taffish> 不再是散落在代码里的特殊分支，而是通过 emitter 机制注册和扩展。这样以后增加新的 tag、新的运行方式或新的后端时，不需要推翻整个系统。

新版还原生支持 @xxx: 这样的作用域参数，这让复杂流程中的分段参数、默认值和上下文绑定更加自然。它也为未来 CLI-GUI 一体化提供了基础。

这次重构之后，TAFFISH 有了更清晰的工程地基。

10. pipe 支持：让容器工具重新回到 Shell

新版 TAFFISH 中，一个重要的设计确认，是 pipe 支持。过去，容器化工具常常像一个盒子。它解决了环境问题，却容易削弱和 Shell 的自然连接。而 TAFFISH 的目标不是把命令行工具关进平台里，而是让容器化工具仍然像普通命令一样使用。

echo 123 | taf-test cat

当这个命令真正跑通时，它确认了一个重要判断：TAFFISH 没有背离 Shell，而是在增强 Shell。它让容器化工具重新回到了 Shell 的管道体系中。

TAFFISH 不是要取代 Shell。TAFFISH 是增强 Shell。它也不是要取代 Nextflow、Snakemake 或 Galaxy。TAFFISH 更关注每个工具本身如何被封装、安装、复现和组合。零件稳定了，流程才能稳定。工具能传递，流程才能传递。环境能复现，结果才有机会复现。

11. 我的思想对 TAFFISH 的影响

TAFFISH 的形状，也受到我自己长期思考方式的影响。我长期喜欢道家、《庄子》和《周易》。这些东西并不直接指导代码实现，但它们影响了我理解系统边界、位置、变化和长期生长的方式。

后来我越来越觉得，一个好的系统不一定是功能最多的系统，而是边界清楚、各部分各安其位、能够长期生长的系统。

Shell 保留组合能力；
容器解决运行环境；
taf-app 封装工具；
taf-flow 组织流程；
TAFFISH-HUB 负责分发；
taf-cli 管理创建、构建、安装和发布。

每一层只做自己最适合做的事。这也是为什么后来我删掉了一些早期复杂 tag，暂缓 GUI，放弃过早的大而全设计。TAFFISH 最终不是要把所有东西都吞进去，而是要让命令行工具、容器、脚本、Hub 各自归位，然后形成一个能长期生长的生态。

如果说这几年 TAFFISH 一直在变化，那么不变的，是它想解决的问题：让科研中的命令行工具和工作流更容易复现、迁移、分发和传承。变化的是我找到答案的方式。

12. 从工具到生态

现在的 TAFFISH 已经不只是一个 .taf 编译器。它正在形成几个层次：

taffish-core：编译器核心
taffish-cli：终端入口
taf-core / taf-cli：项目管理、安装、构建、发布
taffish-hub：工具和流程生态仓库

未来，用户可以用 taf new 创建项目，用 taf check 检查结构，用 taf build 构建本地命令，用 taf publish 发布工具或流程，用 taf install 从 Hub 安装别人发布的 app。

这一路经历过很多转向。它改过名字，推翻过设计，暂缓过一些看起来很诱人的功能，也经历过代码整理和重构压力。有时候，一个人推进这样一个系统会显得很慢，但这些阶段也让 TAFFISH 的边界变得更清楚。

但每一次真实跑通一个工具，每一次跨环境成功运行，每一次把混乱流程变成一个可以调用的 taf-xxx 命令，我都会重新确认：这件事是有价值的。

TAFFISH 最初来自一个很实际的需求：让我在做生物学研究和多组学分析时，不必反复消耗在本可以沉淀和传承的重复工作上。后来它逐渐变成一个更大的问题：科研中的命令行工具，能不能被更好地封装、复现、分发和传承？

TAFFISH 仍在持续演进。但现在我可以说，它已经不只是一个想法了。它已经形成了一套可以运行、可以测试、可以扩展的系统。它从“基因组到疾病”的生物学思考中萌芽，经过多组学联合分析工具、BioFlow、BioHub，再走到今天的 TAFFISH。

TAFFISH 对我来说，不只是一项博士工作。它更像是一条路。这条路还很长，但现在，它已经真正铺到了脚下。