feat: LLM-powered PR code review action by monokrome · Pull Request #1 · LimpidTech/codereview

monokrome · 2026-03-14T22:02:13Z

Summary

Composite GitHub Action (macOS runner) that reviews PRs using configurable LLM providers
Posts inline review comments using conventional comment format (nit:, issue:, suggestion:, etc.)
Submits review verdict (APPROVE, REQUEST_CHANGES, COMMENT) based on findings
Starts with Gemini provider, designed for easy addition of others
Includes dogfooding workflow to review its own PRs

Test plan

Verify the review workflow triggers on this PR
Check that inline comments appear with correct conventional labels
Verify the review verdict is appropriate

Composite GitHub Action that reviews pull requests using configurable LLM providers. Posts inline review comments using conventional comment format (nit:, issue:, suggestion:, etc.) and submits a verdict. Starts with Gemini support, designed to add more providers easily. Includes dogfooding workflow to review its own PRs.

LLMs sometimes reference lines outside the diff. Filter those out with warnings instead of aborting the entire review. Also fixes cache-dependency-path for setup-go.

github-actions

Overall, the project is well-structured with good separation of concerns and robust error handling, especially in the filterValidComments function. However, an issue was found with the diff hunk header reconstruction for the LLM prompt, and a few minor suggestions were identified.

github-actions · 2026-03-14T22:05:10Z

.github/workflows/review.yml

+
+jobs:
+  review:
+    runs-on: macos-latest


suggestion: Consider using ubuntu-latest instead of macos-latest for the runs-on property. Ubuntu runners are generally more cost-effective and often faster for Go applications, unless there's a specific macOS requirement not apparent here.

github-actions · 2026-03-14T22:05:10Z

internal/action/inputs.go

+
+	if cfg.Provider == "" {
+		cfg.Provider = "gemini"
+	}


suggestion: The default for provider is already set to gemini in action.yml. This line (if cfg.Provider == "" { cfg.Provider = "gemini" }) duplicates that logic. It's generally better to have such defaults in a single, clear place, like action.yml.

github-actions · 2026-03-14T22:05:10Z

internal/diff/parse.go

+			newFile := File{Path: path}
+			files = append(files, newFile)
+			current = &files[len(files)-1]
+			current.Path = resolvePathFromHeaders(lines, i, path)


question: Is resolvePathFromHeaders strictly necessary? extractPath already appears to correctly derive the new file path from the diff --git line, even in cases of renames (e.g., diff --git a/old b/new). If extractPath is always reliable for the b/ path, this might be redundant, or perhaps it's a defensive fallback for specific diff formats not covered by extractPath?

github-actions · 2026-03-14T22:05:10Z

internal/github/client.go

+		token:      token,
+		httpClient: &http.Client{},
+	}
+}


suggestion: It's good practice to add a timeout to the http.Client for robustness, especially when making external network requests. E.g., Timeout: 30 * time.Second.

github-actions · 2026-03-14T22:05:10Z

internal/prompt/prompt.go

+func Build(files []diff.File, instructions string) (string, string) {
+	var user strings.Builder
+
+	if instructions != "" {


praise: The systemTemplate is exceptionally well-defined. It clearly outlines the rules, expected JSON structure, comment labels, and output format, which is crucial for guiding the LLM's responses effectively.

github-actions · 2026-03-14T22:05:10Z

internal/prompt/prompt.go

+			fmt.Fprintf(&user, "@@ -%d +%d @@\n", h.StartLine, h.StartLine)
+
+			for _, l := range h.Lines {
+				switch l.Kind {


issue: The diff hunk header reconstruction is inaccurate. fmt.Fprintf(&user, "@@ -%d +%d @@\n", h.StartLine, h.StartLine) generates @@ -X +X @@. A correct unified diff hunk header should be @@ -old_start_line,old_num_lines +new_start_line,new_num_lines @@. The diff.Hunk struct currently only stores StartLine (which is the new start line). To accurately reconstruct the diff for the LLM, the diff.Hunk struct needs to be extended to capture the old_start_line, old_num_lines, new_start_line, and new_num_lines from the hunk header during parsing. Providing the LLM with an accurate diff is critical for its ability to understand the context and generate precise comments.

github-actions · 2026-03-14T22:05:10Z

internal/provider/gemini/gemini.go

+			return provider.Response{}, fmt.Errorf("calling Gemini API: %w", err)
+		}
+		defer resp.Body.Close()
+


suggestion: Similar to the GitHub client, consider using a custom http.Client with a timeout for Gemini API calls to prevent requests from hanging indefinitely.

github-actions · 2026-03-14T22:05:10Z

internal/review/review.go

+
+	var valid []Comment
+	for _, c := range comments {
+		if !IsValidLabel(c.Label) {


praise: The filterValidComments function is excellent. It performs robust validation, ensuring that comments have valid labels, positive line numbers, and correctly map to lines in the new file within the provided diff. This is crucial for successful interaction with the GitHub API and preventing invalid comments.

The action now responds when someone replies to a review comment. Fetches the conversation thread, sends it to the LLM with the relevant diff hunk, and posts a reply. Skips its own comments to prevent infinite loops.

github-actions

The overall architecture and implementation are well-structured and logical. The action successfully sets up the Go environment, builds the application, and orchestrates the review process. The diff parsing, GitHub API interaction, and LLM prompting are handled robustly. Several minor issues and suggestions are provided for improvement, primarily around cost optimization, error handling, and prompt clarity.

github-actions · 2026-03-14T22:10:29Z

.github/workflows/review.yml

+  review:
+    runs-on: macos-latest
+    steps:
+      - uses: actions/checkout@v4


suggestion: Using macos-latest for runs-on is quite expensive. For a Go build and a simple action execution, ubuntu-latest should be sufficient and significantly cheaper. Consider changing this unless there's a specific macOS dependency.

github-actions · 2026-03-14T22:10:29Z

action.yml

+    - name: Setup Go
+      uses: actions/setup-go@v5
+      with:
+        go-version: '1.24'


nit: While 1.24 works, it's generally recommended to pin to a specific patch version (e.g., 1.22.x) or just use 1 to avoid unexpected breaking changes if the upstream action updates 1.24 to a future 1.24.x that has issues, though for Go 1.24 is not a strict release yet.

github-actions · 2026-03-14T22:10:29Z

action.yml

+
+    - name: Build
+      shell: bash
+      run: cd ${{ github.action_path }} && go build -o $RUNNER_TEMP/codereview ./cmd/codereview


suggestion: Building into $RUNNER_TEMP/codereview is fine, but it might be slightly cleaner to build directly into the action's path (e.g., cd ${{ github.action_path }} && go build -o codereview ./cmd/codereview) and then run $(github.action_path)/codereview. This avoids potential issues if $RUNNER_TEMP has unexpected contents or permissions, although less likely with GitHub-managed runners.

github-actions · 2026-03-14T22:10:29Z

internal/action/inputs.go

+	Repository struct {
+		Owner struct {
+			Login string `json:"login"`
+		} `json:"owner"`


thought: The githubEvent struct is quite large and includes fields that are directly mapped to Config or CommentContext. This duplication is handled manually. It's functional, but worth noting the manual mapping might become verbose if event structures grow significantly.

github-actions · 2026-03-14T22:10:29Z

internal/diff/parse.go

+
+			i++
+		}
+


thought: The loop j <= start+4 in resolvePathFromHeaders assumes the relevant headers (like +++) will always appear within the first 4 lines after the diff --git line. This is typically true for standard diff formats, but in very unusual cases, it might miss a +++ line if there are many other headers first. It's a pragmatic choice, not necessarily an 'issue'.

github-actions · 2026-03-14T22:10:29Z

internal/github/client.go

+
+	page := 1
+	for {
+		url := fmt.Sprintf("%s/repos/%s/%s/pulls/%d/comments?per_page=100&page=%d", apiBase, owner, repo, prNumber, page)


issue: In FetchCommentThread, the resp.Body.Close() is called after io.ReadAll(resp.Body). If io.ReadAll returns an error, resp.Body.Close() will still be called (deferred). However, the body variable ([]byte) might not contain valid data for the subsequent json.Unmarshal. If io.ReadAll fails, body would be nil or partial, and json.Unmarshal would fail, but the error message would be misleading. It's safer to check err after io.ReadAll before attempting to unmarshal.

github-actions · 2026-03-14T22:10:29Z

internal/prompt/prompt.go

+- If the developer's response resolves your concern, acknowledge it
+- If you still have concerns, explain why clearly
+- Be collaborative, not adversarial
+- Output plain text only, no JSON, no markdown fences`


nit: The prompt rule Output ONLY the JSON object, no markdown fences, no extra text is given. However, review.stripMarkdownFences exists, implying the LLM might still output fences. While the stripping acts as a safeguard, it might be worth refining the prompt to reinforce strict adherence, or acknowledge that the stripping is a necessary robustness measure.

github-actions · 2026-03-14T22:10:29Z

internal/review/review.go

+	for _, f := range files {
+		lines := make(map[int]bool)
+		for _, h := range f.Hunks {
+			for _, l := range h.Lines {


suggestion: Logging warnings to os.Stderr is appropriate, but consider using a structured logger (e.g., log/slog or an external library) for more consistent and potentially more useful output, especially in CI environments where these warnings might be parsed by other tools. For a simple action, fmt.Fprintf is acceptable.

On synchronize events, fetch the bot's existing review comments and pass them to the LLM as context so it doesn't repeat itself. Also refactors duplicated struct definitions in the GitHub client.

github-actions

The PR introduces a new 'reply' mode and significant refactoring to support it, which is well-structured. However, several previously identified issues and suggestions remain unaddressed, most notably the critical bug related to inaccurate diff hunk header reconstruction for the LLM.

github-actions · 2026-03-14T22:14:35Z

.github/workflows/review.yml

+    steps:
+      - uses: actions/checkout@v4
+
+      - uses: ./


suggestion: The runs-on: macos-latest is still present. As noted previously, ubuntu-latest is generally more cost-effective and often faster for Go applications unless there's a specific macOS requirement not apparent here.

github-actions · 2026-03-14T22:14:35Z

.github/workflows/review.yml

+    types: [opened, synchronize]
+  pull_request_review_comment:
+    types: [created]
+


praise: The addition of pull_request_review_comment: types: [created] to the on trigger is crucial for the new reply functionality and is well-implemented.

github-actions · 2026-03-14T22:14:35Z

.gitignore

@@ -4,7 +4,7 @@
 *.dll
 *.so


nit: Changing codereview to /codereview makes the exclusion more specific to the root binary, which is a good minor improvement.

github-actions · 2026-03-14T22:14:35Z

.projections.json

@@ -0,0 +1,21 @@
+{


praise: This new .projections.json file provides useful configuration for development environments and tooling, improving the developer experience.

github-actions · 2026-03-14T22:14:35Z

action.yml

+        go-version: '1.24'
+        cache-dependency-path: go.sum
+
+    - name: Build


nit: The go-version: '1.24' remains. While 1.24 is fine, it's generally safer to pin to a specific patch version (e.g., 1.22.x) or just use 1 for stability, although 1.24 isn't a stable release yet, so it's less critical here.

github-actions · 2026-03-14T22:14:35Z

internal/prompt/prompt.go

+
+Rules:
+- Be concise and directly address the question or comment
+- Reference the code when relevant


praise: The inclusion of prior comments in the user prompt is a significant improvement, helping the LLM avoid repetition and focus on new feedback or changes addressing previous comments.

github-actions · 2026-03-14T22:14:35Z

internal/prompt/prompt.go

+				switch l.Kind {
+				case diff.KindAdded:
+					fmt.Fprintf(&user, "+%s\n", l.Content)
+				case diff.KindRemoved:


praise: The new replySystemTemplate is clear and concise, providing excellent guidance for the LLM when generating replies within a conversation thread.

github-actions · 2026-03-14T22:14:35Z

internal/provider/gemini/gemini.go

+			return provider.Response{}, fmt.Errorf("calling Gemini API: %w", err)
+		}
+		defer resp.Body.Close()
+


suggestion: Similar to the GitHub client, using http.DefaultClient.Do(httpReq) means there's no timeout configured for Gemini API calls. Consider using a custom http.Client with a timeout to prevent requests from hanging indefinitely, improving the action's robustness.

github-actions · 2026-03-14T22:14:35Z

internal/provider/provider.go

@@ -0,0 +1,14 @@
+package provider


praise: Introducing the provider interface and associated types (Request, Response, ReviewFunc) is a great abstraction, making the codebase more modular and easier to extend with other LLM providers in the future.

github-actions · 2026-03-14T22:14:35Z

internal/review/review.go

+
+	result, err := ParseResponse(resp.Content)
+	if err != nil {
+		return Result{}, fmt.Errorf("parsing response: %w", err)


praise: The RunReply function correctly orchestrates the reply workflow, building the prompt, calling the provider, and processing the response.

On review, fetch the complete contents of each changed file so the LLM can understand surrounding code, not just the diff. Files over 256KB are skipped silently. Deleted files return empty and are also skipped.

github-actions

Key issues related to diff parsing for the LLM and HTTP client robustness remain unaddressed. Please review the highlighted issues.

github-actions · 2026-03-14T22:17:08Z

.github/workflows/review.yml

+  review:
+    runs-on: macos-latest
+    steps:
+      - uses: actions/checkout@v4


suggestion: The runs-on: macos-latest is still present. As noted previously, ubuntu-latest is generally more cost-effective and often faster for Go applications unless there's a specific macOS requirement not apparent here.

github-actions · 2026-03-14T22:17:08Z

.gitignore

 *.so
 *.dylib
-codereview
+/codereview


nit: Changing codereview to /codereview makes the exclusion more specific to the root binary, which is a good minor improvement.

github-actions · 2026-03-14T22:17:08Z

action.yml

+      with:
+        go-version: '1.24'
+        cache-dependency-path: go.sum
+


nit: The go-version: '1.24' remains. While 1.24 is fine, it's generally safer to pin to a specific patch version (e.g., 1.22.x) or just use 1 for stability. As 1.24 is not a stable release yet, this is less critical, but still a general best practice.

github-actions · 2026-03-14T22:17:08Z

internal/action/inputs.go

+
+	var event githubEvent
+	if err := json.Unmarshal(data, &event); err != nil {
+		return Config{}, fmt.Errorf("parsing event JSON: %w", err)


suggestion: This line if cfg.Provider == "" { cfg.Provider = "gemini" } still duplicates the default provider: 'gemini' already set in action.yml. It's generally better to maintain defaults in a single, authoritative place to avoid inconsistencies.

github-actions · 2026-03-14T22:17:08Z

internal/diff/types.go

+}
+
+type File struct {
+	Path  string


issue: The diff.Hunk struct still only contains StartLine int, which represents the new start line. To accurately reconstruct a unified diff hunk header for the LLM (e.g., @@ -old_start_line,old_num_lines +new_start_line,new_num_lines @@), the Hunk struct must be extended to capture OldStartLine, OldNumLines, NewStartLine (which is current StartLine), and NewNumLines during parsing. This is critical for the LLM to precisely understand the diff context.

github-actions · 2026-03-14T22:17:08Z

internal/github/client.go

+
+func New(token string) *Client {
+	return &Client{
+		token:      token,


suggestion: The http.Client created in New still doesn't have a timeout. It's good practice to add a timeout (e.g., Timeout: 30 * time.Second) for robustness when making external network requests.

github-actions · 2026-03-14T22:17:08Z

internal/github/client.go

+		req.Header.Set("Authorization", "Bearer "+c.token)
+		req.Header.Set("Accept", "application/vnd.github.v3+json")
+
+		resp, err := c.httpClient.Do(req)


issue: In fetchAllPRComments, resp.Body.Close() is called after io.ReadAll(resp.Body). If io.ReadAll returns an error, body might be partial or invalid, and json.Unmarshal would receive bad data. It's safer to defer resp.Body.Close() immediately after c.httpClient.Do(req) to ensure it's called reliably, and then check the error from io.ReadAll before attempting to unmarshal the body or check resp.StatusCode.

github-actions · 2026-03-14T22:17:08Z

internal/prompt/prompt.go

+				}
+			}
+		}
+	}


issue: This is directly related to the internal/diff/types.go issue. The hunk header is still being reconstructed incorrectly as @@ -%d +%d @@. A correct unified diff hunk header requires old/new start lines and number of lines (e.g., @@ -old_start,old_num +new_start,new_num @@). This inaccurate format will hinder the LLM's ability to understand the exact context of the changes, leading to less precise comments. This needs to be corrected by using the full hunk header information, which requires updates to the diff.Hunk struct and its parsing logic.

github-actions · 2026-03-14T22:17:08Z

internal/provider/gemini/gemini.go

+			return provider.Response{}, fmt.Errorf("calling Gemini API: %w", err)
+		}
+		defer resp.Body.Close()
+


suggestion: Similar to the GitHub client, using http.DefaultClient.Do(httpReq) means there's no timeout configured for Gemini API calls. Consider using a custom http.Client with a timeout to prevent requests from hanging indefinitely, improving the action's robustness.

The LLM now returns structured replies indicating whether the concern is resolved. If resolved and the thread was started by the bot, it calls the GitHub GraphQL API to resolve the thread automatically.

github-actions

Multiple previously raised issues and suggestions remain unaddressed, most critically the inaccurate diff hunk header generation and HTTP client error handling. Please review the comments and make the necessary changes.

github-actions · 2026-03-14T22:19:58Z

.github/workflows/review.yml

+  review:
+    runs-on: macos-latest
+    steps:
+      - uses: actions/checkout@v4


suggestion: The runs-on: macos-latest is still present. As noted previously, ubuntu-latest is generally more cost-effective and often faster for Go applications unless there's a specific macOS requirement not apparent here.

github-actions · 2026-03-14T22:19:58Z

action.yml

+      with:
+        go-version: '1.24'
+        cache-dependency-path: go.sum
+


nit: The go-version: '1.24' remains. While 1.24 is fine, it's generally safer to pin to a specific patch version (e.g., 1.22.x) or just use 1 for stability. As 1.24 is not a stable release yet, this is less critical, but still a general best practice.

github-actions · 2026-03-14T22:19:58Z

internal/action/inputs.go

+
+	if cfg.Provider == "" {
+		cfg.Provider = "gemini"
+	}


suggestion: This line if cfg.Provider == "" { cfg.Provider = "gemini" } still duplicates the default provider: 'gemini' already set in action.yml. It's generally better to maintain defaults in a single, authoritative place to avoid inconsistencies.

github-actions · 2026-03-14T22:19:58Z

internal/diff/types.go

+	StartLine int
+	Lines     []Line
+}
+


issue: This is a re-highlight of a critical, unaddressed issue. The diff.Hunk struct still only contains StartLine int, which represents the new start line. To accurately reconstruct a unified diff hunk header for the LLM (e.g., @@ -old_start_line,old_num_lines +new_start_line,new_num_lines @@), the Hunk struct must be extended to capture OldStartLine, OldNumLines, NewStartLine (which is current StartLine), and NewNumLines during parsing. This is critical for the LLM to precisely understand the diff context.

github-actions · 2026-03-14T22:19:58Z

internal/github/client.go

+	return &Client{
+		token:      token,
+		httpClient: &http.Client{},
+	}


suggestion: The http.Client created in New still doesn't have a timeout. It's good practice to add a timeout (e.g., Timeout: 30 * time.Second) for robustness when making external network requests.

github-actions · 2026-03-14T22:19:58Z

internal/github/client.go

+
+		body, err := io.ReadAll(resp.Body)
+		resp.Body.Close()
+


issue: This is a re-highlight of a previously identified problem. In fetchAllPRComments, resp.Body.Close() is called after io.ReadAll(resp.Body). If io.ReadAll returns an error (e.g., due to network issues while reading a large body), body might be partial or invalid, and json.Unmarshal would receive bad data. It's safer to defer resp.Body.Close() immediately after c.httpClient.Do(req) to ensure it's called reliably, and then check the error from io.ReadAll before attempting to unmarshal the body or check resp.StatusCode.

github-actions · 2026-03-14T22:19:58Z

internal/provider/gemini/gemini.go

+
+		if resp.StatusCode != http.StatusOK {
+			return provider.Response{}, fmt.Errorf("Gemini API returned status %d: %s", resp.StatusCode, string(respBody))
+		}


suggestion: Similar to the GitHub client, using http.DefaultClient.Do(httpReq) means there's no timeout configured for Gemini API calls. Consider using a custom http.Client with a timeout to prevent requests from hanging indefinitely, improving the action's robustness.

monokrome added 2 commits March 14, 2026 12:47

fix: drop invalid LLM comments instead of failing the review

560afe2

LLMs sometimes reference lines outside the diff. Filter those out with warnings instead of aborting the entire review. Also fixes cache-dependency-path for setup-go.