(doris) branch master updated: [improvement](ci) Refine Codex review goal completion prompt (#64658)

zclll Sun, 21 Jun 2026 18:46:02 -0700

This is an automated email from the ASF dual-hosted git repository.

zclllyybb pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/doris.git



The following commit(s) were added to refs/heads/master by this push:
     new 88991dc3386 [improvement](ci) Refine Codex review goal completion 
prompt (#64658)
88991dc3386 is described below

commit 88991dc3386452818f5dcde44025c839882f0bde
Author: zclllyybb <[email protected]>
AuthorDate: Mon Jun 22 09:45:40 2026 +0800

    [improvement](ci) Refine Codex review goal completion prompt (#64658)
    
    ### What problem does this PR solve?
    
    Problem Summary: I modified the codex to support the `codex exec --goal`
    mode. The Codex review workflow now enters goal mode through the
    runner's --goal invocation, so the review instructions should describe
    the completion criteria and convergence process instead of telling the
    model to create the goal itself.
    
    see modification in
    https://github.com/zclllyybb/codex/tree/codex-exec-goal
    
    Verification:
    - Before:
    https://github.com/apache/doris/pull/63690 got 1 comment when `/review`
    
    - Now:
    https://github.com/zclllyybb/doris/pull/49 replicated the same PR,
    receiving 8 comments in a single `/review`
---
 .claude/skills/code-review/SKILL.md      |  10 ++-
 .github/workflows/code-review-runner.yml | 135 +++++++++++++++++++++++++++++--
 2 files changed, 137 insertions(+), 8 deletions(-)

diff --git a/.claude/skills/code-review/SKILL.md 
b/.claude/skills/code-review/SKILL.md
index 539e36224d9..dea95615b8a 100644
--- a/.claude/skills/code-review/SKILL.md
+++ b/.claude/skills/code-review/SKILL.md
@@ -14,9 +14,13 @@ Use this when you need to review code, whether it is code 
you just completed or
 
 ## How to use me
 
-1. **Always read and respond to Part 1** (General Principles) — it applies to 
all code.
-2. For module-specific review, **read the `AGENTS.md` in the corresponding 
source directory** listed in Part 2. Those files contain non-obvious 
conventions and traps specific to each subsystem.
-3. Parts 3–7 cover cross-module concerns, testing, high-risk patterns, 
functions, and standards — refer as needed.
+0. **MANDATORY GOAL COMPLETION REQUIREMENT:** When the review is running in 
Codex goal mode, the goal is complete only after every changed file and 
relevant surrounding code path has been examined, every suspicious point has 
been accepted as an inline issue or dismissed with evidence, and every accepted 
issue has been submitted and verified on GitHub.
+1. **MANDATORY GOAL PROCESS REQUIREMENT:** The goal's progress tracking must 
cover instruction loading, subagent spawning, shared-ledger maintenance, 
candidate verification/deduplication, final subagent convergence, GitHub review 
submission, and GitHub API verification. The goal is not complete until every 
live subagent has said `NO_NEW_VALUABLE_FINDINGS` for the same current 
ledger/comment set after the last candidate update.
+2. **MANDATORY SUBAGENT REVIEW REQUIREMENT:** Use the available subagent or 
multi-agent spawn tool for focused review passes; do not merely simulate 
subagent output. The main agent must read the subagent results, independently 
verify or dismiss every candidate with concrete code evidence, deduplicate 
against existing review threads, submit the final GitHub review itself, and 
summarize the subagent conclusions.
+3. **MANDATORY SHARED LEDGER REQUIREMENT:** When a shared subagent review 
ledger is provided, every subagent must read the whole ledger and append 
findings only to its assigned subagent section. The main agent must use the 
ledger as the source of truth for merging, status updates, duplicate 
suppression, proposed final comments, and the final convergence round. 
Subagents must not edit another subagent section or any main-owned section; 
this section-owned append-only rule avoids concurrent [...]
+4. **Always read and respond to Part 1** (General Principles) — it applies to 
all code.
+5. For module-specific review, **read the `AGENTS.md` in the corresponding 
source directory** listed in Part 2. Those files contain non-obvious 
conventions and traps specific to each subsystem.
+6. Parts 3–7 cover cross-module concerns, testing, high-risk patterns, 
functions, and standards — refer as needed.
 
 ---
 
diff --git a/.github/workflows/code-review-runner.yml 
b/.github/workflows/code-review-runner.yml
index f7f299bbf7f..c6bcced9774 100644
--- a/.github/workflows/code-review-runner.yml
+++ b/.github/workflows/code-review-runner.yml
@@ -73,6 +73,32 @@ jobs:
           unzip -q "$tmp_dir/ossutil.zip" -d "$tmp_dir"
           sudo install -m 0755 "$tmp_dir/ossutil-v1.7.19-linux-amd64/ossutil" 
/usr/local/bin/ossutil
 
+      - name: Install Codex goal binary
+        run: |
+          codex_cmd="$(command -v codex)"
+          codex_target="$(readlink -f "$codex_cmd")"
+          tmp_dir="$(mktemp -d)"
+          trap 'rm -rf "$tmp_dir"' EXIT
+
+          downloaded=false
+          for object in "$OSS_CODEX_GOAL_OBJECT" 
"$OSS_CODEX_GOAL_FALLBACK_OBJECT"; do
+            if ossutil -i "$OSS_AK" -k "$OSS_SK" -e "$OSS_ENDPOINT" cp -f 
"$object" "$tmp_dir/codex-goal"; then
+              downloaded=true
+              break
+            fi
+          done
+          test "$downloaded" = "true"
+          test -s "$tmp_dir/codex-goal"
+          sudo install -m 0755 "$tmp_dir/codex-goal" "$codex_target"
+          "$codex_cmd" exec --help | grep -q -- '--goal'
+          "$codex_cmd" --version
+        env:
+          OSS_AK: ${{ secrets.OSS_AK }}
+          OSS_SK: ${{ secrets.OSS_SK }}
+          OSS_ENDPOINT: oss-cn-hongkong.aliyuncs.com
+          OSS_CODEX_GOAL_OBJECT: oss://doris-community-ci/codex-goal
+          OSS_CODEX_GOAL_FALLBACK_OBJECT: 
oss://doris-community-ci/codex/codex-goal
+
       - name: Configure Codex auth
         run: |
           install -m 700 -d "$RUNNER_TEMP/codex-home"
@@ -241,6 +267,8 @@ jobs:
       - name: Prepare review prompt
         run: |
           cat > "$REVIEW_CONTEXT_DIR/review_prompt.txt" <<'PROMPT'
+          This review task is executed in Codex goal mode. The review goal is 
complete only after every changed file and relevant surrounding code path has 
been examined, every suspicious point has a clear conclusion, and there is no 
remaining possibility of producing any new valuable review comment.
+
           You are performing an automated code review inside a GitHub Actions 
runner. The gh CLI is available and authenticated via GH_TOKEN.
           The current directory is the code repository for the PR to be 
reviewed.
           You MUST NOT attempt to access any files outside the current 
directory. and you DO NOT need to. But this does not prevent you from normally 
using any skill or web fetch tools.
@@ -257,8 +285,29 @@ jobs:
           - User review focus: PLACEHOLDER_CONTEXT_DIR/review_focus.txt
           - PR changed files: PLACEHOLDER_CONTEXT_DIR/pr_changed_files.txt
           - Required AGENTS.md files: 
PLACEHOLDER_CONTEXT_DIR/required_agents.txt
+          - Shared subagent review ledger: 
PLACEHOLDER_CONTEXT_DIR/subagent_review_findings.md
 
           Before reviewing any code, you MUST read and follow the code review 
skill in this repository. During review, you must strictly follow those 
instructions.
+          The active review goal's progress tracking MUST include, and must 
stay current throughout the review:
+          - Read the review prompt, code-review skill, required AGENTS.md 
files, existing review threads, user focus, changed-file list, and the shared 
subagent review ledger.
+          - Spawn the required focused subagents and assign each subagent a 
dedicated section in the shared ledger before it reviews.
+          - Require every subagent to append only to its assigned ledger 
section, while reading the whole ledger to avoid duplicate candidates.
+          - Read the shared ledger after every subagent result, then 
independently verify, deduplicate, accept, or dismiss every candidate in the 
main merged section.
+          - Run at least one final convergence round where every live subagent 
reviews the current ledger and proposed final comment set.
+          - Continue reviewing if any subagent reports a new valuable 
candidate during convergence; repeat verification and convergence until every 
subagent says `NO_NEW_VALUABLE_FINDINGS` for the same current ledger/comment 
set.
+          - Submit the final GitHub review and verify that all accepted 
comments landed before marking the goal complete.
+          The active review goal MUST remain incomplete until every suspicious 
point found during review has a clear conclusion: submitted as an inline issue, 
dismissed as already covered by existing review context, or dismissed with 
concrete code evidence explaining why it is not a bug.
+          You MUST use the available subagent or multi-agent spawn tool for 
this review; do not merely simulate subagent output in your own text. If tool 
discovery is needed, search for the subagent/multi-agent tool first. Spawn at 
least two subagents before final submission:
+          - one subagent assigned to the `optimizer-rewrite` ledger section 
and focused on optimizer/rewrite correctness, semantic equivalence, and 
parallel join/aggregate paths.
+          - one subagent assigned to the `tests-session-config` ledger section 
and focused on regression tests, expected outputs, session/config propagation, 
compatibility, and basic CI/style failures.
+          The shared subagent review ledger is 
PLACEHOLDER_CONTEXT_DIR/subagent_review_findings.md. Before spawning any 
subagent, the main agent MUST read this ledger. Every subagent prompt MUST 
include this ledger path and the exact ledger section assigned to that 
subagent. Subagents must read the whole ledger before reviewing, avoid 
duplicating existing candidates, and append their findings only under their 
assigned subagent section. They must not rewrite the whole ledger, edit another 
[...]
+          Each subagent must record candidate findings in its own ledger 
section with stable IDs, path/line, claim, evidence, duplicate relationship if 
any, and recommendation. If a candidate overlaps an existing ledger item, the 
subagent should add a duplicate note in its own section that references the 
existing candidate ID instead of modifying the existing candidate. This 
section-owned append-only rule is mandatory to avoid concurrent patch conflicts.
+          The main agent must read the shared ledger after each subagent 
result, merge duplicate candidates into the main merged section, update 
candidate statuses, and keep a proposed final comment set in the main-owned 
ledger sections. Statuses must distinguish at least: proposed_by_subagent, 
accepted_for_inline_comment, dismissed_with_evidence, 
duplicate_of_existing_thread, duplicate_of_candidate, and needs_more_evidence.
+          Before final submission, the main agent MUST send the current ledger 
and proposed final comment set to every live subagent and require a convergence 
response. A subagent may either append a new valuable candidate under its 
assigned section or reply exactly `NO_NEW_VALUABLE_FINDINGS` for the current 
ledger/comment set. The main agent MUST NOT complete the goal, final summary, 
or GitHub review until every live subagent has returned 
`NO_NEW_VALUABLE_FINDINGS` in the same convergen [...]
+          Subagents must only report candidate findings with evidence. The 
main agent must read their results and the shared ledger, independently verify 
or dismiss every candidate with concrete code evidence, deduplicate against 
existing review threads, and submit the final GitHub review itself. The final 
summary must include a short "Subagent conclusions" section describing which 
subagent candidates became inline comments, which were dismissed, which 
duplicates were merged, and which c [...]
+          If and only if the runtime exposes no subagent or multi-agent tool 
after explicit discovery, state that limitation in the final summary and 
continue the review manually.
+          You MUST NOT stop after finding the first blocking issue. Keep 
reviewing changed files, related control flow, tests, and parallel/special-case 
paths until all plausible correctness, lifecycle, configuration, compatibility, 
performance, and coverage bugs have been investigated and every bug you can 
substantiate has been reported.
+          Before submitting the final review, do one explicit final sweep over 
the changed-file list and your unresolved candidate list. Only finish when 
there are no unresolved suspicious points and all possible substantiated bugs 
have been pointed out in the GitHub review.
           Before inspecting the PR diff or related code, you MUST read the 
contents of every AGENTS.md file listed below. These paths are computed from 
the PR changed file ancestors in this checkout. Searching for or listing paths 
is not sufficient; read each listed file directly.
           Required AGENTS.md files for this PR:
           PLACEHOLDER_REQUIRED_AGENTS_BLOCK
@@ -297,6 +346,82 @@ jobs:
           required_agents = required_agents_path.read_text().rstrip()
           
prompt_path.write_text(prompt.replace("PLACEHOLDER_REQUIRED_AGENTS_BLOCK", 
required_agents))
           PY
+
+          cat > "$REVIEW_CONTEXT_DIR/subagent_review_findings.md" <<'LEDGER'
+          # Shared Subagent Review Ledger
+
+          This is the shared source of truth for subagent-assisted review.
+
+          Rules:
+          - Subagents must read this whole file before reviewing.
+          - Each subagent may append only to its assigned section under 
`Subagent Candidate Sections`.
+          - Subagents must not rewrite this file, edit another subagent 
section, edit main-owned sections, edit repository source files, or submit 
GitHub comments.
+          - Avoid duplicates. If a candidate overlaps an existing candidate, 
add a duplicate note in your own section that references the existing candidate 
ID.
+          - The main agent owns final status, final deduplication, GitHub 
review submission, and GitHub API verification.
+          - The goal is not complete until all live subagents have reviewed 
the current ledger and proposed final comment set in the same convergence round 
and replied `NO_NEW_VALUABLE_FINDINGS`.
+
+          Candidate statuses:
+          - proposed_by_subagent
+          - accepted_for_inline_comment
+          - dismissed_with_evidence
+          - duplicate_of_existing_thread
+          - duplicate_of_candidate
+          - needs_more_evidence
+
+          Candidate format:
+
+          - ID:
+            Owner:
+            Status:
+            Path:
+            Line:
+            Claim:
+            Evidence:
+            Duplicate relationship:
+            Recommendation:
+
+          ## Subagent Candidate Sections
+
+          ### optimizer-rewrite
+
+          Subagent assignment: optimizer/rewrite correctness, semantic 
equivalence, and parallel join/aggregate paths.
+          Append only below this line and above the next subagent heading.
+
+          ### tests-session-config
+
+          Subagent assignment: regression tests, expected outputs, 
session/config propagation, compatibility, and basic CI/style failures.
+          Append only below this line and above the main-owned sections.
+
+          ## Main Merged Findings
+
+          Main-owned format:
+
+          - ID:
+            Source IDs:
+            Owner: main
+            Status:
+            Path:
+            Line:
+            Claim:
+            Evidence:
+            Duplicate relationship:
+            Main verification:
+            Proposed inline body:
+
+          ## Dismissed Or Duplicate Points
+
+          ## Proposed Final Comment Set
+
+          ## Convergence Rounds
+          LEDGER
+
+          cat > "$REVIEW_CONTEXT_DIR/codex_goal_prompt.txt" <<EOF
+          You are performing an automated code review inside a GitHub Actions 
runner.
+          This invocation is already running in Codex goal mode. Before 
inspecting the PR diff or related code, read 
${REVIEW_CONTEXT_REL}/review_prompt.txt verbatim and follow that file as the 
complete review instruction set.
+          This review must use the available subagent or multi-agent spawn 
tool for focused review passes. All subagents and the main agent must maintain 
${REVIEW_CONTEXT_REL}/subagent_review_findings.md as the shared source of 
truth. Each subagent may append only to its assigned ledger section; the main 
agent owns merged statuses, duplicate suppression, proposed final comments, 
GitHub review submission, and GitHub API verification.
+          The review goal's completion process must include shared-ledger 
review, subagent spawning with assigned ledger sections, candidate 
verification, duplicate merge, a final subagent convergence round, GitHub 
review submission, and GitHub API verification. Do not complete the goal unless 
every live subagent has replied NO_NEW_VALUABLE_FINDINGS for the same current 
ledger/comment set after the last candidate update.
+          Keep reviewing until every suspicious point has a clear conclusion, 
every substantiated issue has been submitted as a GitHub inline review comment, 
and no new valuable review comment remains possible.
+          EOF
         env:
           REPO: ${{ github.repository }}
           PR_NUMBER: ${{ inputs.pr_number }}
@@ -314,13 +439,13 @@ jobs:
           PR_NUMBER: ${{ inputs.pr_number }}
           HEAD_SHA: ${{ inputs.head_sha }}
         run: |
-          PROMPT=$(cat "$REVIEW_CONTEXT_DIR/review_prompt.txt")
+          GOAL_PROMPT="$(cat "$REVIEW_CONTEXT_DIR/codex_goal_prompt.txt")"
           review_started_at="$(date -u +%Y-%m-%dT%H:%M:%SZ)"
 
           set +e
           # GitHub-hosted runners are ephemeral. Avoid workspace-write here 
because
           # Codex uses bubblewrap for that mode and uid maps can be 
unavailable.
-          codex exec "$PROMPT" \
+          codex exec --goal "$GOAL_PROMPT" \
             --cd "$GITHUB_WORKSPACE" \
             --model "gpt-5.5" \
             --config "model_reasoning_effort=\"${REASONING_EFFORT}\"" \
@@ -398,8 +523,8 @@ jobs:
           HELPER_REF: ${{ github.workflow_sha || github.sha }}
           REASONING_EFFORT: xhigh
         run: |
-          if [ ! -s "$REVIEW_CONTEXT_DIR/review_prompt.txt" ] || [ ! -s 
"$REVIEW_CONTEXT_DIR/codex-events.jsonl" ]; then
-            echo "Review prompt or Codex JSONL event stream is missing; 
skipping Litefuse I/O recording."
+          if [ ! -s "$REVIEW_CONTEXT_DIR/codex_goal_prompt.txt" ] || [ ! -s 
"$REVIEW_CONTEXT_DIR/codex-events.jsonl" ]; then
+            echo "Goal prompt or Codex JSONL event stream is missing; skipping 
Litefuse I/O recording."
             exit 0
           fi
 
@@ -411,7 +536,7 @@ jobs:
           chmod 700 "$helper"
 
           python3 "$helper" \
-            --input-file "$REVIEW_CONTEXT_DIR/review_prompt.txt" \
+            --input-file "$REVIEW_CONTEXT_DIR/codex_goal_prompt.txt" \
             --events-file "$REVIEW_CONTEXT_DIR/codex-events.jsonl" \
             --output-file "$REVIEW_CONTEXT_DIR/codex-final-message.txt" \
             --trace-name "doris-ai-review" \


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

(doris) branch master updated: [improvement](ci) Refine Codex review goal completion prompt (#64658)

Reply via email to