mirror of
https://github.com/postgres/postgres.git
synced 2025-05-02 11:44:50 +03:00
Attempting such an operation would already fail, but in various and confusing ways. For example, while in recovery, some elog() messages would be reported, but these should never be user-facing. This commit restricts any write operations done on large objects in a read-only context, so as the errors generated are more user-friendly. This is per the discussion done with Tom Lane and Robert Haas. Some regression tests are added to check the case of all the SQL functions working on large objects (including an update of the test's alternate output). Author: Yugo Nagata Discussion: https://postgr.es/m/20220527153028.61a4608f66abcd026fd3806f@sraoss.co.jp
999 lines
34 KiB
Plaintext
999 lines
34 KiB
Plaintext
<!-- doc/src/sgml/lobj.sgml -->
|
|
|
|
<chapter id="largeobjects">
|
|
<title>Large Objects</title>
|
|
|
|
<indexterm zone="largeobjects"><primary>large object</primary></indexterm>
|
|
<indexterm><primary>BLOB</primary><see>large object</see></indexterm>
|
|
|
|
<para>
|
|
<productname>PostgreSQL</productname> has a <firstterm>large object</firstterm>
|
|
facility, which provides stream-style access to user data that is stored
|
|
in a special large-object structure. Streaming access is useful
|
|
when working with data values that are too large to manipulate
|
|
conveniently as a whole.
|
|
</para>
|
|
|
|
<para>
|
|
This chapter describes the implementation and the programming and
|
|
query language interfaces to <productname>PostgreSQL</productname>
|
|
large object data. We use the <application>libpq</application> C
|
|
library for the examples in this chapter, but most programming
|
|
interfaces native to <productname>PostgreSQL</productname> support
|
|
equivalent functionality. Other interfaces might use the large
|
|
object interface internally to provide generic support for large
|
|
values. This is not described here.
|
|
</para>
|
|
|
|
<sect1 id="lo-intro">
|
|
<title>Introduction</title>
|
|
|
|
<indexterm>
|
|
<primary>TOAST</primary>
|
|
<secondary>versus large objects</secondary>
|
|
</indexterm>
|
|
|
|
<para>
|
|
All large objects are stored in a single system table named <link
|
|
linkend="catalog-pg-largeobject"><structname>pg_largeobject</structname></link>.
|
|
Each large object also has an entry in the system table <link
|
|
linkend="catalog-pg-largeobject-metadata"><structname>pg_largeobject_metadata</structname></link>.
|
|
Large objects can be created, modified, and deleted using a read/write API
|
|
that is similar to standard operations on files.
|
|
</para>
|
|
|
|
<para>
|
|
<productname>PostgreSQL</productname> also supports a storage system called
|
|
<link
|
|
linkend="storage-toast"><quote><acronym>TOAST</acronym></quote></link>,
|
|
which automatically stores values
|
|
larger than a single database page into a secondary storage area per table.
|
|
This makes the large object facility partially obsolete. One
|
|
remaining advantage of the large object facility is that it allows values
|
|
up to 4 TB in size, whereas <acronym>TOAST</acronym>ed fields can be at
|
|
most 1 GB. Also, reading and updating portions of a large object can be
|
|
done efficiently, while most operations on a <acronym>TOAST</acronym>ed
|
|
field will read or write the whole value as a unit.
|
|
</para>
|
|
|
|
</sect1>
|
|
|
|
<sect1 id="lo-implementation">
|
|
<title>Implementation Features</title>
|
|
|
|
<para>
|
|
The large object implementation breaks large
|
|
objects up into <quote>chunks</quote> and stores the chunks in
|
|
rows in the database. A B-tree index guarantees fast
|
|
searches for the correct chunk number when doing random
|
|
access reads and writes.
|
|
</para>
|
|
|
|
<para>
|
|
The chunks stored for a large object do not have to be contiguous.
|
|
For example, if an application opens a new large object, seeks to offset
|
|
1000000, and writes a few bytes there, this does not result in allocation
|
|
of 1000000 bytes worth of storage; only of chunks covering the range of
|
|
data bytes actually written. A read operation will, however, read out
|
|
zeroes for any unallocated locations preceding the last existing chunk.
|
|
This corresponds to the common behavior of <quote>sparsely allocated</quote>
|
|
files in <acronym>Unix</acronym> file systems.
|
|
</para>
|
|
|
|
<para>
|
|
As of <productname>PostgreSQL</productname> 9.0, large objects have an owner
|
|
and a set of access permissions, which can be managed using
|
|
<xref linkend="sql-grant"/> and
|
|
<xref linkend="sql-revoke"/>.
|
|
<literal>SELECT</literal> privileges are required to read a large
|
|
object, and
|
|
<literal>UPDATE</literal> privileges are required to write or
|
|
truncate it.
|
|
Only the large object's owner (or a database superuser) can delete,
|
|
comment on, or change the owner of a large object.
|
|
To adjust this behavior for compatibility with prior releases, see the
|
|
<xref linkend="guc-lo-compat-privileges"/> run-time parameter.
|
|
</para>
|
|
</sect1>
|
|
|
|
<sect1 id="lo-interfaces">
|
|
<title>Client Interfaces</title>
|
|
|
|
<para>
|
|
This section describes the facilities that
|
|
<productname>PostgreSQL</productname>'s <application>libpq</application>
|
|
client interface library provides for accessing large objects.
|
|
The <productname>PostgreSQL</productname> large object interface is
|
|
modeled after the <acronym>Unix</acronym> file-system interface, with
|
|
analogues of <function>open</function>, <function>read</function>,
|
|
<function>write</function>,
|
|
<function>lseek</function>, etc.
|
|
</para>
|
|
|
|
<para>
|
|
All large object manipulation using these functions
|
|
<emphasis>must</emphasis> take place within an SQL transaction block,
|
|
since large object file descriptors are only valid for the duration of
|
|
a transaction. Write operations, including <function>lo_open</function>
|
|
with the <symbol>INV_WRITE</symbol> mode, are not allowed in a read-only
|
|
transaction.
|
|
</para>
|
|
|
|
<para>
|
|
If an error occurs while executing any one of these functions, the
|
|
function will return an otherwise-impossible value, typically 0 or -1.
|
|
A message describing the error is stored in the connection object and
|
|
can be retrieved with <xref linkend="libpq-PQerrorMessage"/>.
|
|
</para>
|
|
|
|
<para>
|
|
Client applications that use these functions should include the header file
|
|
<filename>libpq/libpq-fs.h</filename> and link with the
|
|
<application>libpq</application> library.
|
|
</para>
|
|
|
|
<para>
|
|
Client applications cannot use these functions while a libpq connection is in pipeline mode.
|
|
</para>
|
|
|
|
<sect2 id="lo-create">
|
|
<title>Creating a Large Object</title>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_create</primary></indexterm>
|
|
The function
|
|
<synopsis>
|
|
Oid lo_create(PGconn *conn, Oid lobjId);
|
|
</synopsis>
|
|
creates a new large object. The OID to be assigned can be
|
|
specified by <replaceable class="parameter">lobjId</replaceable>;
|
|
if so, failure occurs if that OID is already in use for some large
|
|
object. If <replaceable class="parameter">lobjId</replaceable>
|
|
is <symbol>InvalidOid</symbol> (zero) then <function>lo_create</function>
|
|
assigns an unused OID.
|
|
The return value is the OID that was assigned to the new large object,
|
|
or <symbol>InvalidOid</symbol> (zero) on failure.
|
|
</para>
|
|
|
|
<para>
|
|
An example:
|
|
<programlisting>
|
|
inv_oid = lo_create(conn, desired_oid);
|
|
</programlisting>
|
|
</para>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_creat</primary></indexterm>
|
|
The older function
|
|
<synopsis>
|
|
Oid lo_creat(PGconn *conn, int mode);
|
|
</synopsis>
|
|
also creates a new large object, always assigning an unused OID.
|
|
The return value is the OID that was assigned to the new large object,
|
|
or <symbol>InvalidOid</symbol> (zero) on failure.
|
|
</para>
|
|
|
|
<para>
|
|
In <productname>PostgreSQL</productname> releases 8.1 and later,
|
|
the <replaceable class="parameter">mode</replaceable> is ignored,
|
|
so that <function>lo_creat</function> is exactly equivalent to
|
|
<function>lo_create</function> with a zero second argument.
|
|
However, there is little reason to use <function>lo_creat</function>
|
|
unless you need to work with servers older than 8.1.
|
|
To work with such an old server, you must
|
|
use <function>lo_creat</function> not <function>lo_create</function>,
|
|
and you must set <replaceable class="parameter">mode</replaceable> to
|
|
one of <symbol>INV_READ</symbol>, <symbol>INV_WRITE</symbol>,
|
|
or <symbol>INV_READ</symbol> <literal>|</literal> <symbol>INV_WRITE</symbol>.
|
|
(These symbolic constants are defined
|
|
in the header file <filename>libpq/libpq-fs.h</filename>.)
|
|
</para>
|
|
|
|
<para>
|
|
An example:
|
|
<programlisting>
|
|
inv_oid = lo_creat(conn, INV_READ|INV_WRITE);
|
|
</programlisting>
|
|
</para>
|
|
</sect2>
|
|
|
|
<sect2 id="lo-import">
|
|
<title>Importing a Large Object</title>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_import</primary></indexterm>
|
|
To import an operating system file as a large object, call
|
|
<synopsis>
|
|
Oid lo_import(PGconn *conn, const char *filename);
|
|
</synopsis>
|
|
<replaceable class="parameter">filename</replaceable>
|
|
specifies the operating system name of
|
|
the file to be imported as a large object.
|
|
The return value is the OID that was assigned to the new large object,
|
|
or <symbol>InvalidOid</symbol> (zero) on failure.
|
|
Note that the file is read by the client interface library, not by
|
|
the server; so it must exist in the client file system and be readable
|
|
by the client application.
|
|
</para>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_import_with_oid</primary></indexterm>
|
|
The function
|
|
<synopsis>
|
|
Oid lo_import_with_oid(PGconn *conn, const char *filename, Oid lobjId);
|
|
</synopsis>
|
|
also imports a new large object. The OID to be assigned can be
|
|
specified by <replaceable class="parameter">lobjId</replaceable>;
|
|
if so, failure occurs if that OID is already in use for some large
|
|
object. If <replaceable class="parameter">lobjId</replaceable>
|
|
is <symbol>InvalidOid</symbol> (zero) then <function>lo_import_with_oid</function> assigns an unused
|
|
OID (this is the same behavior as <function>lo_import</function>).
|
|
The return value is the OID that was assigned to the new large object,
|
|
or <symbol>InvalidOid</symbol> (zero) on failure.
|
|
</para>
|
|
|
|
<para>
|
|
<function>lo_import_with_oid</function> is new as of <productname>PostgreSQL</productname>
|
|
8.4 and uses <function>lo_create</function> internally which is new in 8.1; if this function is run against 8.0 or before, it will
|
|
fail and return <symbol>InvalidOid</symbol>.
|
|
</para>
|
|
</sect2>
|
|
|
|
<sect2 id="lo-export">
|
|
<title>Exporting a Large Object</title>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_export</primary></indexterm>
|
|
To export a large object
|
|
into an operating system file, call
|
|
<synopsis>
|
|
int lo_export(PGconn *conn, Oid lobjId, const char *filename);
|
|
</synopsis>
|
|
The <parameter>lobjId</parameter> argument specifies the OID of the large
|
|
object to export and the <parameter>filename</parameter> argument
|
|
specifies the operating system name of the file. Note that the file is
|
|
written by the client interface library, not by the server. Returns 1
|
|
on success, -1 on failure.
|
|
</para>
|
|
</sect2>
|
|
|
|
<sect2 id="lo-open">
|
|
<title>Opening an Existing Large Object</title>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_open</primary></indexterm>
|
|
To open an existing large object for reading or writing, call
|
|
<synopsis>
|
|
int lo_open(PGconn *conn, Oid lobjId, int mode);
|
|
</synopsis>
|
|
The <parameter>lobjId</parameter> argument specifies the OID of the large
|
|
object to open. The <parameter>mode</parameter> bits control whether the
|
|
object is opened for reading (<symbol>INV_READ</symbol>), writing
|
|
(<symbol>INV_WRITE</symbol>), or both.
|
|
(These symbolic constants are defined
|
|
in the header file <filename>libpq/libpq-fs.h</filename>.)
|
|
<function>lo_open</function> returns a (non-negative) large object
|
|
descriptor for later use in <function>lo_read</function>,
|
|
<function>lo_write</function>, <function>lo_lseek</function>,
|
|
<function>lo_lseek64</function>, <function>lo_tell</function>,
|
|
<function>lo_tell64</function>, <function>lo_truncate</function>,
|
|
<function>lo_truncate64</function>, and <function>lo_close</function>.
|
|
The descriptor is only valid for
|
|
the duration of the current transaction.
|
|
On failure, -1 is returned.
|
|
</para>
|
|
|
|
<para>
|
|
The server currently does not distinguish between modes
|
|
<symbol>INV_WRITE</symbol> and <symbol>INV_READ</symbol> <literal>|</literal>
|
|
<symbol>INV_WRITE</symbol>: you are allowed to read from the descriptor
|
|
in either case. However there is a significant difference between
|
|
these modes and <symbol>INV_READ</symbol> alone: with <symbol>INV_READ</symbol>
|
|
you cannot write on the descriptor, and the data read from it will
|
|
reflect the contents of the large object at the time of the transaction
|
|
snapshot that was active when <function>lo_open</function> was executed,
|
|
regardless of later writes by this or other transactions. Reading
|
|
from a descriptor opened with <symbol>INV_WRITE</symbol> returns
|
|
data that reflects all writes of other committed transactions as well
|
|
as writes of the current transaction. This is similar to the behavior
|
|
of <literal>REPEATABLE READ</literal> versus <literal>READ COMMITTED</literal> transaction
|
|
modes for ordinary SQL <command>SELECT</command> commands.
|
|
</para>
|
|
|
|
<para>
|
|
<function>lo_open</function> will fail if <literal>SELECT</literal>
|
|
privilege is not available for the large object, or
|
|
if <symbol>INV_WRITE</symbol> is specified and <literal>UPDATE</literal>
|
|
privilege is not available.
|
|
(Prior to <productname>PostgreSQL</productname> 11, these privilege
|
|
checks were instead performed at the first actual read or write call
|
|
using the descriptor.)
|
|
These privilege checks can be disabled with the
|
|
<xref linkend="guc-lo-compat-privileges"/> run-time parameter.
|
|
</para>
|
|
|
|
<para>
|
|
An example:
|
|
<programlisting>
|
|
inv_fd = lo_open(conn, inv_oid, INV_READ|INV_WRITE);
|
|
</programlisting>
|
|
</para>
|
|
</sect2>
|
|
|
|
<sect2 id="lo-write">
|
|
<title>Writing Data to a Large Object</title>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_write</primary></indexterm>
|
|
The function
|
|
<synopsis>
|
|
int lo_write(PGconn *conn, int fd, const char *buf, size_t len);
|
|
</synopsis>
|
|
writes <parameter>len</parameter> bytes from <parameter>buf</parameter>
|
|
(which must be of size <parameter>len</parameter>) to large object
|
|
descriptor <parameter>fd</parameter>. The <parameter>fd</parameter> argument must
|
|
have been returned by a previous <function>lo_open</function>. The
|
|
number of bytes actually written is returned (in the current
|
|
implementation, this will always equal <parameter>len</parameter> unless
|
|
there is an error). In the event of an error, the return value is -1.
|
|
</para>
|
|
|
|
<para>
|
|
Although the <parameter>len</parameter> parameter is declared as
|
|
<type>size_t</type>, this function will reject length values larger than
|
|
<literal>INT_MAX</literal>. In practice, it's best to transfer data in chunks
|
|
of at most a few megabytes anyway.
|
|
</para>
|
|
</sect2>
|
|
|
|
<sect2 id="lo-read">
|
|
<title>Reading Data from a Large Object</title>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_read</primary></indexterm>
|
|
The function
|
|
<synopsis>
|
|
int lo_read(PGconn *conn, int fd, char *buf, size_t len);
|
|
</synopsis>
|
|
reads up to <parameter>len</parameter> bytes from large object descriptor
|
|
<parameter>fd</parameter> into <parameter>buf</parameter> (which must be
|
|
of size <parameter>len</parameter>). The <parameter>fd</parameter>
|
|
argument must have been returned by a previous
|
|
<function>lo_open</function>. The number of bytes actually read is
|
|
returned; this will be less than <parameter>len</parameter> if the end of
|
|
the large object is reached first. In the event of an error, the return
|
|
value is -1.
|
|
</para>
|
|
|
|
<para>
|
|
Although the <parameter>len</parameter> parameter is declared as
|
|
<type>size_t</type>, this function will reject length values larger than
|
|
<literal>INT_MAX</literal>. In practice, it's best to transfer data in chunks
|
|
of at most a few megabytes anyway.
|
|
</para>
|
|
</sect2>
|
|
|
|
<sect2 id="lo-seek">
|
|
<title>Seeking in a Large Object</title>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_lseek</primary></indexterm>
|
|
To change the current read or write location associated with a
|
|
large object descriptor, call
|
|
<synopsis>
|
|
int lo_lseek(PGconn *conn, int fd, int offset, int whence);
|
|
</synopsis>
|
|
This function moves the
|
|
current location pointer for the large object descriptor identified by
|
|
<parameter>fd</parameter> to the new location specified by
|
|
<parameter>offset</parameter>. The valid values for <parameter>whence</parameter>
|
|
are <symbol>SEEK_SET</symbol> (seek from object start),
|
|
<symbol>SEEK_CUR</symbol> (seek from current position), and
|
|
<symbol>SEEK_END</symbol> (seek from object end). The return value is
|
|
the new location pointer, or -1 on error.
|
|
</para>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_lseek64</primary></indexterm>
|
|
When dealing with large objects that might exceed 2GB in size,
|
|
instead use
|
|
<synopsis>
|
|
pg_int64 lo_lseek64(PGconn *conn, int fd, pg_int64 offset, int whence);
|
|
</synopsis>
|
|
This function has the same behavior
|
|
as <function>lo_lseek</function>, but it can accept an
|
|
<parameter>offset</parameter> larger than 2GB and/or deliver a result larger
|
|
than 2GB.
|
|
Note that <function>lo_lseek</function> will fail if the new location
|
|
pointer would be greater than 2GB.
|
|
</para>
|
|
|
|
<para>
|
|
<function>lo_lseek64</function> is new as of <productname>PostgreSQL</productname>
|
|
9.3. If this function is run against an older server version, it will
|
|
fail and return -1.
|
|
</para>
|
|
|
|
</sect2>
|
|
|
|
<sect2 id="lo-tell">
|
|
<title>Obtaining the Seek Position of a Large Object</title>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_tell</primary></indexterm>
|
|
To obtain the current read or write location of a large object descriptor,
|
|
call
|
|
<synopsis>
|
|
int lo_tell(PGconn *conn, int fd);
|
|
</synopsis>
|
|
If there is an error, the return value is -1.
|
|
</para>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_tell64</primary></indexterm>
|
|
When dealing with large objects that might exceed 2GB in size,
|
|
instead use
|
|
<synopsis>
|
|
pg_int64 lo_tell64(PGconn *conn, int fd);
|
|
</synopsis>
|
|
This function has the same behavior
|
|
as <function>lo_tell</function>, but it can deliver a result larger
|
|
than 2GB.
|
|
Note that <function>lo_tell</function> will fail if the current
|
|
read/write location is greater than 2GB.
|
|
</para>
|
|
|
|
<para>
|
|
<function>lo_tell64</function> is new as of <productname>PostgreSQL</productname>
|
|
9.3. If this function is run against an older server version, it will
|
|
fail and return -1.
|
|
</para>
|
|
</sect2>
|
|
|
|
<sect2 id="lo-truncate">
|
|
<title>Truncating a Large Object</title>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_truncate</primary></indexterm>
|
|
To truncate a large object to a given length, call
|
|
<synopsis>
|
|
int lo_truncate(PGconn *conn, int fd, size_t len);
|
|
</synopsis>
|
|
This function truncates the large object
|
|
descriptor <parameter>fd</parameter> to length <parameter>len</parameter>. The
|
|
<parameter>fd</parameter> argument must have been returned by a
|
|
previous <function>lo_open</function>. If <parameter>len</parameter> is
|
|
greater than the large object's current length, the large object
|
|
is extended to the specified length with null bytes ('\0').
|
|
On success, <function>lo_truncate</function> returns
|
|
zero. On error, the return value is -1.
|
|
</para>
|
|
|
|
<para>
|
|
The read/write location associated with the descriptor
|
|
<parameter>fd</parameter> is not changed.
|
|
</para>
|
|
|
|
<para>
|
|
Although the <parameter>len</parameter> parameter is declared as
|
|
<type>size_t</type>, <function>lo_truncate</function> will reject length
|
|
values larger than <literal>INT_MAX</literal>.
|
|
</para>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_truncate64</primary></indexterm>
|
|
When dealing with large objects that might exceed 2GB in size,
|
|
instead use
|
|
<synopsis>
|
|
int lo_truncate64(PGconn *conn, int fd, pg_int64 len);
|
|
</synopsis>
|
|
This function has the same
|
|
behavior as <function>lo_truncate</function>, but it can accept a
|
|
<parameter>len</parameter> value exceeding 2GB.
|
|
</para>
|
|
|
|
<para>
|
|
<function>lo_truncate</function> is new as of <productname>PostgreSQL</productname>
|
|
8.3; if this function is run against an older server version, it will
|
|
fail and return -1.
|
|
</para>
|
|
|
|
<para>
|
|
<function>lo_truncate64</function> is new as of <productname>PostgreSQL</productname>
|
|
9.3; if this function is run against an older server version, it will
|
|
fail and return -1.
|
|
</para>
|
|
</sect2>
|
|
|
|
<sect2 id="lo-close">
|
|
<title>Closing a Large Object Descriptor</title>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_close</primary></indexterm>
|
|
A large object descriptor can be closed by calling
|
|
<synopsis>
|
|
int lo_close(PGconn *conn, int fd);
|
|
</synopsis>
|
|
where <parameter>fd</parameter> is a
|
|
large object descriptor returned by <function>lo_open</function>.
|
|
On success, <function>lo_close</function> returns zero. On
|
|
error, the return value is -1.
|
|
</para>
|
|
|
|
<para>
|
|
Any large object descriptors that remain open at the end of a
|
|
transaction will be closed automatically.
|
|
</para>
|
|
</sect2>
|
|
|
|
<sect2 id="lo-unlink">
|
|
<title>Removing a Large Object</title>
|
|
|
|
<para>
|
|
<indexterm><primary>lo_unlink</primary></indexterm>
|
|
To remove a large object from the database, call
|
|
<synopsis>
|
|
int lo_unlink(PGconn *conn, Oid lobjId);
|
|
</synopsis>
|
|
The <parameter>lobjId</parameter> argument specifies the OID of the
|
|
large object to remove. Returns 1 if successful, -1 on failure.
|
|
</para>
|
|
</sect2>
|
|
|
|
</sect1>
|
|
|
|
<sect1 id="lo-funcs">
|
|
<title>Server-Side Functions</title>
|
|
|
|
<para>
|
|
Server-side functions tailored for manipulating large objects from SQL are
|
|
listed in <xref linkend="lo-funcs-table"/>.
|
|
</para>
|
|
|
|
<table id="lo-funcs-table">
|
|
<title>SQL-Oriented Large Object Functions</title>
|
|
<tgroup cols="1">
|
|
<thead>
|
|
<row>
|
|
<entry role="func_table_entry"><para role="func_signature">
|
|
Function
|
|
</para>
|
|
<para>
|
|
Description
|
|
</para>
|
|
<para>
|
|
Example(s)
|
|
</para></entry>
|
|
</row>
|
|
</thead>
|
|
|
|
<tbody>
|
|
<row>
|
|
<entry role="func_table_entry"><para role="func_signature">
|
|
<indexterm>
|
|
<primary>lo_from_bytea</primary>
|
|
</indexterm>
|
|
<function>lo_from_bytea</function> ( <parameter>loid</parameter> <type>oid</type>, <parameter>data</parameter> <type>bytea</type> )
|
|
<returnvalue>oid</returnvalue>
|
|
</para>
|
|
<para>
|
|
Creates a large object and stores <parameter>data</parameter> in it.
|
|
If <parameter>loid</parameter> is zero then the system will choose a
|
|
free OID, otherwise that OID is used (with an error if some large
|
|
object already has that OID). On success, the large object's OID is
|
|
returned.
|
|
</para>
|
|
<para>
|
|
<literal>lo_from_bytea(0, '\xffffff00')</literal>
|
|
<returnvalue>24528</returnvalue>
|
|
</para></entry>
|
|
</row>
|
|
|
|
<row>
|
|
<entry role="func_table_entry"><para role="func_signature">
|
|
<indexterm>
|
|
<primary>lo_put</primary>
|
|
</indexterm>
|
|
<function>lo_put</function> ( <parameter>loid</parameter> <type>oid</type>, <parameter>offset</parameter> <type>bigint</type>, <parameter>data</parameter> <type>bytea</type> )
|
|
<returnvalue>void</returnvalue>
|
|
</para>
|
|
<para>
|
|
Writes <parameter>data</parameter> starting at the given offset within
|
|
the large object; the large object is enlarged if necessary.
|
|
</para>
|
|
<para>
|
|
<literal>lo_put(24528, 1, '\xaa')</literal>
|
|
<returnvalue></returnvalue>
|
|
</para></entry>
|
|
</row>
|
|
|
|
<row>
|
|
<entry role="func_table_entry"><para role="func_signature">
|
|
<indexterm>
|
|
<primary>lo_get</primary>
|
|
</indexterm>
|
|
<function>lo_get</function> ( <parameter>loid</parameter> <type>oid</type> <optional>, <parameter>offset</parameter> <type>bigint</type>, <parameter>length</parameter> <type>integer</type> </optional> )
|
|
<returnvalue>bytea</returnvalue>
|
|
</para>
|
|
<para>
|
|
Extracts the large object's contents, or a substring thereof.
|
|
</para>
|
|
<para>
|
|
<literal>lo_get(24528, 0, 3)</literal>
|
|
<returnvalue>\xffaaff</returnvalue>
|
|
</para></entry>
|
|
</row>
|
|
</tbody>
|
|
</tgroup>
|
|
</table>
|
|
|
|
<para>
|
|
There are additional server-side functions corresponding to each of the
|
|
client-side functions described earlier; indeed, for the most part the
|
|
client-side functions are simply interfaces to the equivalent server-side
|
|
functions. The ones just as convenient to call via SQL commands are
|
|
<function>lo_creat</function><indexterm><primary>lo_creat</primary></indexterm>,
|
|
<function>lo_create</function>,
|
|
<function>lo_unlink</function><indexterm><primary>lo_unlink</primary></indexterm>,
|
|
<function>lo_import</function><indexterm><primary>lo_import</primary></indexterm>, and
|
|
<function>lo_export</function><indexterm><primary>lo_export</primary></indexterm>.
|
|
Here are examples of their use:
|
|
|
|
<programlisting>
|
|
CREATE TABLE image (
|
|
name text,
|
|
raster oid
|
|
);
|
|
|
|
SELECT lo_creat(-1); -- returns OID of new, empty large object
|
|
|
|
SELECT lo_create(43213); -- attempts to create large object with OID 43213
|
|
|
|
SELECT lo_unlink(173454); -- deletes large object with OID 173454
|
|
|
|
INSERT INTO image (name, raster)
|
|
VALUES ('beautiful image', lo_import('/etc/motd'));
|
|
|
|
INSERT INTO image (name, raster) -- same as above, but specify OID to use
|
|
VALUES ('beautiful image', lo_import('/etc/motd', 68583));
|
|
|
|
SELECT lo_export(image.raster, '/tmp/motd') FROM image
|
|
WHERE name = 'beautiful image';
|
|
</programlisting>
|
|
</para>
|
|
|
|
<para>
|
|
The server-side <function>lo_import</function> and
|
|
<function>lo_export</function> functions behave considerably differently
|
|
from their client-side analogs. These two functions read and write files
|
|
in the server's file system, using the permissions of the database's
|
|
owning user. Therefore, by default their use is restricted to superusers.
|
|
In contrast, the client-side import and export functions read and write
|
|
files in the client's file system, using the permissions of the client
|
|
program. The client-side functions do not require any database
|
|
privileges, except the privilege to read or write the large object in
|
|
question.
|
|
</para>
|
|
|
|
<caution>
|
|
<para>
|
|
It is possible to <xref linkend="sql-grant"/> use of the
|
|
server-side <function>lo_import</function>
|
|
and <function>lo_export</function> functions to non-superusers, but
|
|
careful consideration of the security implications is required. A
|
|
malicious user of such privileges could easily parlay them into becoming
|
|
superuser (for example by rewriting server configuration files), or could
|
|
attack the rest of the server's file system without bothering to obtain
|
|
database superuser privileges as such. <emphasis>Access to roles having
|
|
such privilege must therefore be guarded just as carefully as access to
|
|
superuser roles.</emphasis> Nonetheless, if use of
|
|
server-side <function>lo_import</function>
|
|
or <function>lo_export</function> is needed for some routine task, it's
|
|
safer to use a role with such privileges than one with full superuser
|
|
privileges, as that helps to reduce the risk of damage from accidental
|
|
errors.
|
|
</para>
|
|
</caution>
|
|
|
|
<para>
|
|
The functionality of <function>lo_read</function> and
|
|
<function>lo_write</function> is also available via server-side calls,
|
|
but the names of the server-side functions differ from the client side
|
|
interfaces in that they do not contain underscores. You must call
|
|
these functions as <function>loread</function> and <function>lowrite</function>.
|
|
</para>
|
|
|
|
</sect1>
|
|
|
|
<sect1 id="lo-examplesect">
|
|
<title>Example Program</title>
|
|
|
|
<para>
|
|
<xref linkend="lo-example"/> is a sample program which shows how the large object
|
|
interface
|
|
in <application>libpq</application> can be used. Parts of the program are
|
|
commented out but are left in the source for the reader's
|
|
benefit. This program can also be found in
|
|
<filename>src/test/examples/testlo.c</filename> in the source distribution.
|
|
</para>
|
|
|
|
<example id="lo-example">
|
|
<title>Large Objects with <application>libpq</application> Example Program</title>
|
|
<programlisting><![CDATA[
|
|
/*-----------------------------------------------------------------
|
|
*
|
|
* testlo.c
|
|
* test using large objects with libpq
|
|
*
|
|
* Portions Copyright (c) 1996-2022, PostgreSQL Global Development Group
|
|
* Portions Copyright (c) 1994, Regents of the University of California
|
|
*
|
|
*
|
|
* IDENTIFICATION
|
|
* src/test/examples/testlo.c
|
|
*
|
|
*-----------------------------------------------------------------
|
|
*/
|
|
#include <stdio.h>
|
|
#include <stdlib.h>
|
|
|
|
#include <sys/types.h>
|
|
#include <sys/stat.h>
|
|
#include <fcntl.h>
|
|
#include <unistd.h>
|
|
|
|
#include "libpq-fe.h"
|
|
#include "libpq/libpq-fs.h"
|
|
|
|
#define BUFSIZE 1024
|
|
|
|
/*
|
|
* importFile -
|
|
* import file "in_filename" into database as large object "lobjOid"
|
|
*
|
|
*/
|
|
static Oid
|
|
importFile(PGconn *conn, char *filename)
|
|
{
|
|
Oid lobjId;
|
|
int lobj_fd;
|
|
char buf[BUFSIZE];
|
|
int nbytes,
|
|
tmp;
|
|
int fd;
|
|
|
|
/*
|
|
* open the file to be read in
|
|
*/
|
|
fd = open(filename, O_RDONLY, 0666);
|
|
if (fd < 0)
|
|
{ /* error */
|
|
fprintf(stderr, "cannot open unix file\"%s\"\n", filename);
|
|
}
|
|
|
|
/*
|
|
* create the large object
|
|
*/
|
|
lobjId = lo_creat(conn, INV_READ | INV_WRITE);
|
|
if (lobjId == 0)
|
|
fprintf(stderr, "cannot create large object");
|
|
|
|
lobj_fd = lo_open(conn, lobjId, INV_WRITE);
|
|
|
|
/*
|
|
* read in from the Unix file and write to the inversion file
|
|
*/
|
|
while ((nbytes = read(fd, buf, BUFSIZE)) > 0)
|
|
{
|
|
tmp = lo_write(conn, lobj_fd, buf, nbytes);
|
|
if (tmp < nbytes)
|
|
fprintf(stderr, "error while reading \"%s\"", filename);
|
|
}
|
|
|
|
close(fd);
|
|
lo_close(conn, lobj_fd);
|
|
|
|
return lobjId;
|
|
}
|
|
|
|
static void
|
|
pickout(PGconn *conn, Oid lobjId, int start, int len)
|
|
{
|
|
int lobj_fd;
|
|
char *buf;
|
|
int nbytes;
|
|
int nread;
|
|
|
|
lobj_fd = lo_open(conn, lobjId, INV_READ);
|
|
if (lobj_fd < 0)
|
|
fprintf(stderr, "cannot open large object %u", lobjId);
|
|
|
|
lo_lseek(conn, lobj_fd, start, SEEK_SET);
|
|
buf = malloc(len + 1);
|
|
|
|
nread = 0;
|
|
while (len - nread > 0)
|
|
{
|
|
nbytes = lo_read(conn, lobj_fd, buf, len - nread);
|
|
buf[nbytes] = '\0';
|
|
fprintf(stderr, ">>> %s", buf);
|
|
nread += nbytes;
|
|
if (nbytes <= 0)
|
|
break; /* no more data? */
|
|
}
|
|
free(buf);
|
|
fprintf(stderr, "\n");
|
|
lo_close(conn, lobj_fd);
|
|
}
|
|
|
|
static void
|
|
overwrite(PGconn *conn, Oid lobjId, int start, int len)
|
|
{
|
|
int lobj_fd;
|
|
char *buf;
|
|
int nbytes;
|
|
int nwritten;
|
|
int i;
|
|
|
|
lobj_fd = lo_open(conn, lobjId, INV_WRITE);
|
|
if (lobj_fd < 0)
|
|
fprintf(stderr, "cannot open large object %u", lobjId);
|
|
|
|
lo_lseek(conn, lobj_fd, start, SEEK_SET);
|
|
buf = malloc(len + 1);
|
|
|
|
for (i = 0; i < len; i++)
|
|
buf[i] = 'X';
|
|
buf[i] = '\0';
|
|
|
|
nwritten = 0;
|
|
while (len - nwritten > 0)
|
|
{
|
|
nbytes = lo_write(conn, lobj_fd, buf + nwritten, len - nwritten);
|
|
nwritten += nbytes;
|
|
if (nbytes <= 0)
|
|
{
|
|
fprintf(stderr, "\nWRITE FAILED!\n");
|
|
break;
|
|
}
|
|
}
|
|
free(buf);
|
|
fprintf(stderr, "\n");
|
|
lo_close(conn, lobj_fd);
|
|
}
|
|
|
|
|
|
/*
|
|
* exportFile -
|
|
* export large object "lobjOid" to file "out_filename"
|
|
*
|
|
*/
|
|
static void
|
|
exportFile(PGconn *conn, Oid lobjId, char *filename)
|
|
{
|
|
int lobj_fd;
|
|
char buf[BUFSIZE];
|
|
int nbytes,
|
|
tmp;
|
|
int fd;
|
|
|
|
/*
|
|
* open the large object
|
|
*/
|
|
lobj_fd = lo_open(conn, lobjId, INV_READ);
|
|
if (lobj_fd < 0)
|
|
fprintf(stderr, "cannot open large object %u", lobjId);
|
|
|
|
/*
|
|
* open the file to be written to
|
|
*/
|
|
fd = open(filename, O_CREAT | O_WRONLY | O_TRUNC, 0666);
|
|
if (fd < 0)
|
|
{ /* error */
|
|
fprintf(stderr, "cannot open unix file\"%s\"",
|
|
filename);
|
|
}
|
|
|
|
/*
|
|
* read in from the inversion file and write to the Unix file
|
|
*/
|
|
while ((nbytes = lo_read(conn, lobj_fd, buf, BUFSIZE)) > 0)
|
|
{
|
|
tmp = write(fd, buf, nbytes);
|
|
if (tmp < nbytes)
|
|
{
|
|
fprintf(stderr, "error while writing \"%s\"",
|
|
filename);
|
|
}
|
|
}
|
|
|
|
lo_close(conn, lobj_fd);
|
|
close(fd);
|
|
}
|
|
|
|
static void
|
|
exit_nicely(PGconn *conn)
|
|
{
|
|
PQfinish(conn);
|
|
exit(1);
|
|
}
|
|
|
|
int
|
|
main(int argc, char **argv)
|
|
{
|
|
char *in_filename,
|
|
*out_filename;
|
|
char *database;
|
|
Oid lobjOid;
|
|
PGconn *conn;
|
|
PGresult *res;
|
|
|
|
if (argc != 4)
|
|
{
|
|
fprintf(stderr, "Usage: %s database_name in_filename out_filename\n",
|
|
argv[0]);
|
|
exit(1);
|
|
}
|
|
|
|
database = argv[1];
|
|
in_filename = argv[2];
|
|
out_filename = argv[3];
|
|
|
|
/*
|
|
* set up the connection
|
|
*/
|
|
conn = PQsetdb(NULL, NULL, NULL, NULL, database);
|
|
|
|
/* check to see that the backend connection was successfully made */
|
|
if (PQstatus(conn) != CONNECTION_OK)
|
|
{
|
|
fprintf(stderr, "%s", PQerrorMessage(conn));
|
|
exit_nicely(conn);
|
|
}
|
|
|
|
/* Set always-secure search path, so malicious users can't take control. */
|
|
res = PQexec(conn,
|
|
"SELECT pg_catalog.set_config('search_path', '', false)");
|
|
if (PQresultStatus(res) != PGRES_TUPLES_OK)
|
|
{
|
|
fprintf(stderr, "SET failed: %s", PQerrorMessage(conn));
|
|
PQclear(res);
|
|
exit_nicely(conn);
|
|
}
|
|
PQclear(res);
|
|
|
|
res = PQexec(conn, "begin");
|
|
PQclear(res);
|
|
printf("importing file \"%s\" ...\n", in_filename);
|
|
/* lobjOid = importFile(conn, in_filename); */
|
|
lobjOid = lo_import(conn, in_filename);
|
|
if (lobjOid == 0)
|
|
fprintf(stderr, "%s\n", PQerrorMessage(conn));
|
|
else
|
|
{
|
|
printf("\tas large object %u.\n", lobjOid);
|
|
|
|
printf("picking out bytes 1000-2000 of the large object\n");
|
|
pickout(conn, lobjOid, 1000, 1000);
|
|
|
|
printf("overwriting bytes 1000-2000 of the large object with X's\n");
|
|
overwrite(conn, lobjOid, 1000, 1000);
|
|
|
|
printf("exporting large object to file \"%s\" ...\n", out_filename);
|
|
/* exportFile(conn, lobjOid, out_filename); */
|
|
if (lo_export(conn, lobjOid, out_filename) < 0)
|
|
fprintf(stderr, "%s\n", PQerrorMessage(conn));
|
|
}
|
|
|
|
res = PQexec(conn, "end");
|
|
PQclear(res);
|
|
PQfinish(conn);
|
|
return 0;
|
|
}
|
|
]]>
|
|
</programlisting>
|
|
</example>
|
|
|
|
</sect1>
|
|
</chapter>
|