package torch
Install
Dune Dependency
Authors
Maintainers
Sources
md5=c64a4ae1c677f9e2c2528bec6b70e733
sha512=2617754f8e5d127758ce66519683886d660ea9747659d6a3d58d0eb2e299ba9e3da7e5a559dc784bf36cdad477c59394f6ff93657378c572e17596bde52f4537
doc/torch.vision/Torch_vision/Stb_image/index.html
Module Torch_vision.Stb_image
Source
Image representation
type 'kind buffer = ('a, 'b, Bigarray.c_layout) Bigarray.Array1.t constraint 'kind = ('a, 'b) Bigarray.kind
buffer
simply is an alias to a bigarray with c_layout. The buffer
type serves two purposes:
- representing input files,
- representing the raw pixels of an image.
Two kind of pixel buffers are manipulated:
- int8 for images with 8-bit channels
- float32 for images with floating point channels
type 'kind t = private {
width : int;
height : int;
channels : int;
offset : int;
stride : int;
data : 'kind buffer;
}
A record describing an image. The buffer contains channels * width * height
items, in this order:
- channels are interleaved
- each pixel is made of
channels
items - each line is made of
width
pixels - image is made of
height
lines
Creating image
Image accessors
Image decoding
Load an 8-bit per channel image from a filename. If channels
is specified, it has to be between 1 and 4 and the decoded image will be processed to have the requested number of channels.
Load a floating point channel image from a filename. See load
for channels
parameter.
Decode an 8-bit per channel image from a buffer. See load
for channels
parameter.
Decode a floating point channel image from a buffer. See load
for channels
parameter.
Low-level interface
Functions are similar to the above one, except memory is not managed by OCaml GC. It has to be released explicitly with free_unmanaged
function.
You get slightly faster load times, more deterministic memory use and more responsibility. Use at your own risk!
Image filtering
Generate one level of mipmap: downsample image half in each dimension. In mipmap imgin imgout
:
- imgout.channels must be imgin.channels
- imgout.width must be imgin.width / 2
- imgout.height must be imgin.height / 2
- imgout.data will be filled with downsampled imgin.data