Added support to download binary data from result grid. #4011 by pravesh-sharma · Pull Request #9645 · pgadmin-org/pgadmin4

pravesh-sharma · 2026-02-18T08:59:36Z

Summary by CodeRabbit

New Features
- Added ability to download binary data directly from query result grids via a download button (when enabled in preferences).
Documentation
- Clarified that certain download preferences are applicable only in desktop mode.
- Documented the new binary data download preference with details on default behavior and memory considerations.

coderabbitai · 2026-02-19T06:37:51Z

Walkthrough

This pull request adds functionality to download binary data from SQL editor result grids. Changes include a new user preference, a backend endpoint for streaming binary data, frontend UI components with download buttons, event handling for triggering downloads, and database typecaster utilities for binary data conversion.

Changes

Cohort / File(s)	Summary
Documentation `docs/en_US/preferences.rst`	Clarified desktop-only visibility for download-related preferences and added documentation for the new "Enable binary data download?" option with memory usage notes.
Backend Preference Registration `web/pgadmin/misc/__init__.py`	Registered new `enable_binary_data_download` preference (boolean, default False) with descriptive label and help text warning about memory usage.
Backend Download Endpoint `web/pgadmin/tools/sqleditor/__init__.py`	Implemented new `/download_binary_data/<trans_id>` POST endpoint to validate transaction state, register binary typecasters, retrieve target cell coordinates from request, and stream binary data as octet-stream attachment via Flask's send_file.
Frontend Event Definition `web/pgadmin/tools/sqleditor/static/js/components/QueryToolConstants.js`	Added `TRIGGER_SAVE_BINARY_DATA` event key to QUERY_TOOL_EVENTS.
Frontend UI Component `web/pgadmin/tools/sqleditor/static/js/components/QueryToolDataGrid/Formatters.jsx`	Updated BinaryFormatter to display a conditional download button alongside binary values when the download preference is enabled, firing TRIGGER_SAVE_BINARY_DATA event on click with row and column positions.
Frontend Event Handler `web/pgadmin/tools/sqleditor/static/js/components/sections/ResultSet.jsx`	Added event listener for TRIGGER_SAVE_BINARY_DATA that generates timestamped filenames, shows download progress UI, calls new saveBinaryResultsToFile method to POST to backend endpoint, and handles completion/errors.
Database Type Handling `web/pgadmin/utils/driver/psycopg3/typecast.py`	Added register_binary_data_typecasters function and two new Loader classes (ByteaDataLoader, ByteaBinaryDataLoader) to handle binary data conversion from database values; updated InetLoader to coerce memoryview to bytes.

Sequence Diagram

sequenceDiagram
    participant User
    participant BinaryFormatter
    participant EventBus
    participant ResultSet
    participant Frontend API
    participant SQLEditorBackend
    participant Database

    User->>BinaryFormatter: Click download button
    BinaryFormatter->>EventBus: Fire TRIGGER_SAVE_BINARY_DATA<br/>(rowPos, colPos)
    EventBus->>ResultSet: Event triggered
    ResultSet->>ResultSet: Generate timestamped filename<br/>Show download progress
    ResultSet->>Frontend API: POST /download_binary_data<br/>(trans_id, rowPos, colPos)
    Frontend API->>SQLEditorBackend: HTTP POST request
    SQLEditorBackend->>SQLEditorBackend: Validate transaction state<br/>Register binary typecasters
    SQLEditorBackend->>Database: Fetch cell data at position
    Database->>SQLEditorBackend: Return binary data
    SQLEditorBackend->>Frontend API: Stream as octet-stream attachment
    Frontend API->>ResultSet: Receive binary stream
    ResultSet->>User: Download file to disk<br/>Clear progress indicator

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 29.41% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the main change: adding support to download binary data from the result grid, which is the primary feature across all modified files.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Tip

Try Coding Plans. Let us write the prompt for your AI agent so you can ship faster (with fewer bugs).
Share your feedback on Discord.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 3

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@web/pgadmin/tools/sqleditor/__init__.py`:
- Around line 2200-2209: The code dereferences conn._Connection__async_cursor
before verifying the results of check_transaction_status, which can return a
Response or a None conn; update the logic in the block that calls
check_transaction_status(trans_id) to first check if the returned value is a
Response and return it directly, then verify conn, trans_obj and session_obj are
not None before accessing conn._Connection__async_cursor; only after those
guards call register_binary_data_typecasters(cur). Ensure you stop ignoring
error_msg by either logging/including it in the returned internal_server_error
call (using TRANSACTION_STATUS_CHECK_FAILED as context) so Ruff's
unused-variable warning is resolved and real transaction errors are surfaced.
- Around line 2220-2223: The code uses cursor.scroll in relative mode and
doesn't validate inputs, so fix by parsing and validating data['rowpos'] and
data['colpos'] (ensure they are integers >= 0), call cur.scroll(int_rowpos,
mode='absolute') to move to the exact row, call cur.fetchone() and check for
None (row out of range), then verify the returned row has enough columns before
accessing binary_data[col_pos']; on any invalid input or out-of-range access
raise/return a proper validation response instead of letting
IndexError/ValueError bubble up; wrap the parsing/validation and fetch in a
try/except to return a controlled error for bad inputs.

In
`@web/pgadmin/tools/sqleditor/static/js/components/QueryToolDataGrid/Formatters.jsx`:
- Around line 76-85: BinaryFormatter is using the hard-coded row.__temp_PK which
can be undefined when the server shifts the key (e.g. __temp_PK1); instead
locate the actual client-PK key on the row object and pass its value to
eventBus.fireEvent for QUERY_TOOL_EVENTS.TRIGGER_SAVE_BINARY_DATA. Change the
onClick handler in BinaryFormatter to compute the client PK key by scanning
Object.keys(row) for the property that startsWith('__temp_PK') (or otherwise
matches the server-generated client-PK name used elsewhere in ResultSet.jsx),
read row[thatKey], and use that value instead of row.__temp_PK when firing the
event; keep the rest of the component and props (row, column, eventBus,
downloadBinaryData) unchanged.

web/pgadmin/tools/sqleditor/__init__.py

web/pgadmin/tools/sqleditor/static/js/components/QueryToolDataGrid/Formatters.jsx

web/pgadmin/tools/sqleditor/__init__.py

coderabbitai

Actionable comments posted: 3

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@web/pgadmin/tools/sqleditor/__init__.py`:
- Around line 2253-2254: Remove the trailing whitespace at the end of the line
containing the return call to send_file in __init__.py (the line starting with
"return send_file(") to fix the pycodestyle W291; edit the return send_file(
call so there is no space before the newline (e.g., change "return send_file(␣"
to "return send_file(") and re-run linters to confirm the warning is resolved.
- Around line 2200-2212: The current handling after
check_transaction_status(trans_id) treats any error_msg as a 500 and may
double-wrap Response objects; instead, inspect the returned error_msg from
check_transaction_status (e.g., ERROR_MSG_TRANS_ID_NOT_FOUND) and handle it
explicitly: if error_msg equals the "not found" constant return a 404 Response
(do not call internal_server_error), if error_msg is already a Response return
it directly, otherwise fall back to internal_server_error(errormsg=error_msg);
keep the subsequent guard that returns
internal_server_error(TRANSACTION_STATUS_CHECK_FAILED) only for missing/invalid
status/conn/trans_obj/session_obj as before.

In
`@web/pgadmin/tools/sqleditor/static/js/components/QueryToolDataGrid/Formatters.jsx`:
- Around line 76-86: BinaryFormatter currently passes a page-relative
props.rowIdx into eventBus.fireEvent for
QUERY_TOOL_EVENTS.TRIGGER_SAVE_BINARY_DATA, which breaks downloads when
pagination is present; update BinaryFormatter to get pagination base from
DataGridExtrasContext (or use dataGridExtras?.startRowNum) and compute an
absolute index as props.rowIdx + (startRowNum ?? 1) - 1, then pass that absolute
index to eventBus.fireEvent instead of props.rowIdx so the saved binary data
targets the correct row.

---

Duplicate comments:
In `@web/pgadmin/tools/sqleditor/__init__.py`:
- Around line 2230-2247: The code advances the DB cursor before validating
inputs and then scrolls again, which can skip rows; remove the premature
cur.scroll(int(data['rowpos'])) and the initial cur.fetchone() usage, validate
rowpos/colpos first (ensure they are ints and non-negative), then call
cur.scroll(row_pos, mode='absolute') to move to the absolute row and fetch with
cur.fetchone(), check that row is not None and col_pos < len(row) before
accessing row[col_pos], and keep the existing exception handling
(ValueError/IndexError/TypeError) with the internal_server_error/gettext path
unchanged.

web/pgadmin/tools/sqleditor/__init__.py

web/pgadmin/tools/sqleditor/static/js/components/QueryToolDataGrid/Formatters.jsx

coderabbitai

Actionable comments posted: 6

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@web/pgadmin/tools/sqleditor/__init__.py`:
- Around line 2226-2234: The error message returned when request data is missing
is incorrect/copy-pasted; update the gettext string in the block that sets data
= request.values if request.values else request.get_json(silent=True) so the
make_json_response call (success=0, status=410) references the expected binary
data parameters rather than "query" — e.g. change the gettext text to mention
the required binary payload/parameter names (for example "file", "binary" or
whatever this endpoint expects) so callers know which binary parameters are
missing; keep the existing use of gettext and make_json_response.
- Around line 2258-2263: The code calls send_file(BytesIO(binary_data), ...)
where binary_data comes from row[col_pos]; if the cell is SQL NULL (binary_data
is None) BytesIO(None) raises TypeError outside the try/except, so before
constructing BytesIO check whether binary_data is None (or empty) and return a
user-friendly error response (e.g., an HTTP error/flash/message) instead of
attempting to call BytesIO; update the branch around binary_data/row[col_pos]
that calls send_file and ensure send_file(BytesIO(...)) is only executed when
binary_data is not None, referencing the existing variables binary_data,
row[col_pos], BytesIO and send_file.
- Around line 2235-2238: Remove the unvalidated duplicate cursor operations:
delete the four lines that set col_pos = data['colpos'], call
cur.scroll(int(data['rowpos'])), assign binary_data = cur.fetchone(), and then
binary_data = binary_data[col_pos]; these are run before the existing try/except
and can raise KeyError or use an uncast col_pos and corrupt cursor state—keep
only the validated logic already present in the try/except that performs
cur.scroll, cur.fetchone and integer casting of col_pos.
- Around line 2219-2224: The call to register_binary_data_typecasters(cur)
mutates adapters on the shared cursor conn._Connection__async_cursor
(registering ByteaBinaryDataLoader for OIDs 17 and 1001) and breaks subsequent
grid fetches; fix by creating and using a dedicated cursor for the
binary-download path instead of mutating the shared cursor — e.g., obtain a new
cursor (not conn._Connection__async_cursor), call
register_binary_data_typecasters(new_cur) and perform the binary fetch/streaming
on new_cur, then close it; alternatively, if creating a separate cursor is
infeasible, capture the original typecasters from the shared cursor, call
register_binary_data_typecasters(cur) for the binary read, then restore the
saved adapters immediately after the download completes.

In
`@web/pgadmin/tools/sqleditor/static/js/components/QueryToolDataGrid/Formatters.jsx`:
- Line 87: Remove the stray debug console.log call in Formatters.jsx: delete the
line "console.log(absoluteRowPos)" (it triggers ESLint no-console and semi
errors); if runtime debugging is required instead use the project's logger API
or wrap in a debug-only conditional, but the immediate fix is to remove that
console.log reference to absoluteRowPos so the file passes linting.
- Around line 83-86: clientPK is being assigned sequentially in processRows
using clientPKLastIndex without accounting for the fetchWindow pagination
offset, so rows re-fetched on earlier pages get wrong absolute positions; update
processRows to compute clientPK from the window offset (e.g., clientPK =
(fetchWindowOffset || 0) + localIndex) or set clientPKLastIndex to the current
page start before incrementing, and when reading absoluteRowPos use
parseInt(dataGridExtras?.rowKeyGetter?.(row) ?? 0, 10) to include the radix;
locate and change the code in processRows and the absoluteRowPos calculation to
apply the pagination offset (use the fetchWindow/offset value available there or
from dataGridExtras) and add the radix argument to parseInt.

web/pgadmin/tools/sqleditor/__init__.py

web/pgadmin/tools/sqleditor/static/js/components/QueryToolDataGrid/Formatters.jsx

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@web/pgadmin/tools/sqleditor/__init__.py`:
- Around line 2219-2257: The handler mutates the live async result cursor
conn._Connection__async_cursor by calling cur.scroll(...) and cur.fetchone(), so
preserve/restore the cursor position (or use a separate cursor) to avoid
changing shared state: either capture the current position (e.g., cur.rownumber
or equivalent on the cursor) before calling cur.scroll/cur.fetchone and then
restore it with cur.scroll(...) in the finally block, or perform the binary read
using a new cursor from conn (e.g., conn.cursor()) so
register_binary_typecasters, cur.scroll and cur.fetchone operate on an isolated
cursor and leave conn._Connection__async_cursor unchanged.

In `@web/pgadmin/utils/driver/psycopg3/typecast.py`:
- Around line 258-274: ByteaDataLoader.load currently only hex-decodes
memoryview and can return str on fallback; change load to normalize memoryview,
bytes/bytearray and str to a bytes object and always return bytes or None: for
memoryview/bytes/bytearray call b = bytes(data); if b.startswith(b'\\x') strip
the initial b'\\x', decode the remainder as ASCII hex and return
bytes.fromhex(...), catching ValueError and returning the original bytes on
error; for str convert to bytes (e.g. via encode('utf-8')) first and apply the
same logic; ensure the final return type is bytes or None and update
ByteaDataLoader.load accordingly.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 84603580-642a-4e56-af20-c4806639ac2b

📥 Commits

Reviewing files that changed from the base of the PR and between d17131f and 4aacb38.

📒 Files selected for processing (7)

docs/en_US/preferences.rst
web/pgadmin/misc/__init__.py
web/pgadmin/tools/sqleditor/__init__.py
web/pgadmin/tools/sqleditor/static/js/components/QueryToolConstants.js
web/pgadmin/tools/sqleditor/static/js/components/QueryToolDataGrid/Formatters.jsx
web/pgadmin/tools/sqleditor/static/js/components/sections/ResultSet.jsx
web/pgadmin/utils/driver/psycopg3/typecast.py

🚧 Files skipped from review as they are similar to previous changes (3)

web/pgadmin/tools/sqleditor/static/js/components/sections/ResultSet.jsx
docs/en_US/preferences.rst
web/pgadmin/misc/init.py

coderabbitai · 2026-03-06T10:32:08Z

web/pgadmin/tools/sqleditor/__init__.py

+    cur = conn._Connection__async_cursor
+    if cur is None:
+        return internal_server_error(
+            errormsg=gettext('No active result cursor.')
+        )
+    register_binary_data_typecasters(cur)
+
+    data = request.values if request.values else request.get_json(silent=True)
+    if data is None:
+        return make_json_response(
+            status=410,
+            success=0,
+            errormsg=gettext(
+                "Could not find the required parameters (rowpos, colpos)."
+            )
+        )
+
+    try:
+        row_pos = int(data['rowpos'])
+        col_pos = int(data['colpos'])
+        if row_pos < 0 or col_pos < 0:
+            raise ValueError
+        cur.scroll(row_pos)
+        row = cur.fetchone()
+        if row is None or col_pos >= len(row):
+            return internal_server_error(
+                errormsg=gettext('Requested cell is out of range.')
+            )
+        binary_data = row[col_pos]
+    except (ValueError, IndexError, TypeError) as e:
+        current_app.logger.error(e)
+        return internal_server_error(
+            errormsg='Invalid row/column position.'
+        )
+    finally:
+        # Always restore the original typecasters
+        # (works on connection or cursor)
+        register_binary_typecasters(cur)
+


⚠️ Potential issue | 🟠 Major

Preserve the async cursor position around the binary read.

This handler scrolls and reads from conn._Connection__async_cursor, which is the live result-set cursor backing the query tool. After cur.scroll(row_pos) and cur.fetchone(), the cursor is left advanced to the downloaded row, so later operations that reuse cursor state can continue from the wrong place. Please save/restore the cursor position here, or do the binary fetch on a separate cursor.

🧰 Tools

🪛 Ruff (0.15.4)

[warning] 2240-2240: Abstract raise to an inner function

(TRY301)

[warning] 2249-2249: Use logging.exception instead of logging.error

Replace with exception

(TRY400)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@web/pgadmin/tools/sqleditor/__init__.py` around lines 2219 - 2257, The handler mutates the live async result cursor conn._Connection__async_cursor by calling cur.scroll(...) and cur.fetchone(), so preserve/restore the cursor position (or use a separate cursor) to avoid changing shared state: either capture the current position (e.g., cur.rownumber or equivalent on the cursor) before calling cur.scroll/cur.fetchone and then restore it with cur.scroll(...) in the finally block, or perform the binary read using a new cursor from conn (e.g., conn.cursor()) so register_binary_typecasters, cur.scroll and cur.fetchone operate on an isolated cursor and leave conn._Connection__async_cursor unchanged.

coderabbitai · 2026-03-06T10:32:08Z

web/pgadmin/utils/driver/psycopg3/typecast.py

+class ByteaDataLoader(Loader):
+    # Loads the actual binary data.
+    def load(self, data):
+        if data:
+            if isinstance(data, memoryview):
+                data = bytes(data).decode()
+                if data.startswith('\\x'):
+                    data = data[2:]
+                try:
+                    return bytes.fromhex(data)
+                except ValueError:
+                    # In case of error while converting hex to bytes, return
+                    # original data.
+                    return data
+            else:
+                return data
+        return data if data is not None else None


⚠️ Potential issue | 🟠 Major

ByteaDataLoader doesn’t consistently return decoded bytes.

The hex-decoding path only runs for memoryview. If psycopg hands this loader a plain bytes buffer, the code returns the textual bytea representation unchanged, so the download endpoint can stream b'\\x...' instead of the original payload. On the fallback path it can also return a str, which is incompatible with BytesIO. Please normalize both bytes and memoryview through the same conversion path and keep the return type bytes | None.

🔧 Suggested direction

class ByteaDataLoader(Loader): # Loads the actual binary data. def load(self, data): - if data: - if isinstance(data, memoryview): - data = bytes(data).decode() - if data.startswith('\\x'): - data = data[2:] - try: - return bytes.fromhex(data) - except ValueError: - # In case of error while converting hex to bytes, return - # original data. - return data - else: - return data - return data if data is not None else None + if data is None: + return None + + raw = bytes(data) if isinstance(data, memoryview) else data + if isinstance(raw, bytes) and raw.startswith(b'\\x'): + return bytes.fromhex(raw[2:].decode()) + + return raw

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@web/pgadmin/utils/driver/psycopg3/typecast.py` around lines 258 - 274, ByteaDataLoader.load currently only hex-decodes memoryview and can return str on fallback; change load to normalize memoryview, bytes/bytearray and str to a bytes object and always return bytes or None: for memoryview/bytes/bytearray call b = bytes(data); if b.startswith(b'\\x') strip the initial b'\\x', decode the remainder as ASCII hex and return bytes.fromhex(...), catching ValueError and returning the original bytes on error; for str convert to bytes (e.g. via encode('utf-8')) first and apply the same logic; ensure the final return type is bytes or None and update ByteaDataLoader.load accordingly.

pravesh-sharma force-pushed the GH-4011 branch from cbc7383 to 5326bda Compare February 19, 2026 06:37

coderabbitai bot reviewed Feb 19, 2026

View reviewed changes

web/pgadmin/tools/sqleditor/__init__.py Show resolved Hide resolved

web/pgadmin/tools/sqleditor/__init__.py Outdated Show resolved Hide resolved

web/pgadmin/tools/sqleditor/static/js/components/QueryToolDataGrid/Formatters.jsx Outdated Show resolved Hide resolved

pravesh-sharma force-pushed the GH-4011 branch 2 times, most recently from bcef5c9 to e1ead7e Compare February 20, 2026 02:27

pravesh-sharma commented Feb 20, 2026

View reviewed changes

web/pgadmin/tools/sqleditor/__init__.py Outdated Show resolved Hide resolved

pravesh-sharma commented Feb 20, 2026

View reviewed changes

web/pgadmin/tools/sqleditor/__init__.py Show resolved Hide resolved

coderabbitai bot reviewed Feb 20, 2026

View reviewed changes

web/pgadmin/tools/sqleditor/__init__.py Show resolved Hide resolved

web/pgadmin/tools/sqleditor/__init__.py Outdated Show resolved Hide resolved

web/pgadmin/tools/sqleditor/static/js/components/QueryToolDataGrid/Formatters.jsx Show resolved Hide resolved

coderabbitai bot reviewed Feb 20, 2026

View reviewed changes

pravesh-sharma added 3 commits March 6, 2026 15:37

Added support to download binary data from result grid. pgadmin-org#4011

35687b6

Some more review fix

1a5514b

Fixed review comments.

a524eaa

pravesh-sharma force-pushed the GH-4011 branch from d17131f to 055ca1b Compare March 6, 2026 10:10

Fixed review comments.

4aacb38

pravesh-sharma force-pushed the GH-4011 branch from 055ca1b to 4aacb38 Compare March 6, 2026 10:13

coderabbitai bot reviewed Mar 6, 2026

View reviewed changes

Conversation

pravesh-sharma commented Feb 18, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pravesh-sharma commented Feb 18, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 19, 2026 •

edited

Loading