package orsetto
Install
Dune Dependency
Authors
Maintainers
Sources
sha256=151ca6df499bd3de7aa89a4e1627411fbee24c4dea6e0e71ce21f06f181ee654
md5=00393728b481c2bf15919a8202732335
doc/orsetto.json/Json_scan/index.html
Module Json_scan
JavaScript Object Notation (JSON) input scanner.
Overview
Functions that scan Unicode text for JSON interchange language syntax. A low-level lexical token scanner is provided for monadic parsing of any valid JSON text, including infinite streams. Higher-level scanners are provided for composing bespoke parsers according to a required schema. A scanner is also provided for input of arbitrary finite JSON values.
Lexical Analyzer
module Annot = Ucs_scan.UTF8.Annot
A convenient alias for the UTF-8 text annotation system.
val lexer : Json_event.t Annot.form Ucs_scan.UTF8.t
The lexical analyzer that recognizes JSON events.
Parser
Include the scanner profile. The first stage scanner is the lexical analyzer augmented to raise Bad_syntax
on failure to recognize a valid event.
include Cf_scan.Staging.Profile
with type symbol := Uchar.t
and type token := Json_event.t
and type position := Cf_annot.Textual.position
and type 'a form := 'a Annot.form
include Cf_scan.Profile
with type symbol := Json_event.t
with type position := Cf_annot.Textual.position
with type 'a form := 'a Annot.form
include Cf_monad.Unary.Profile with type +'r t := 'r t
Module inclusions from Cf_monad_core
and Cf_seqmonad
.
include Cf_monad.Core.Unary.Profile with type 'r t := 'r t
val return : 'r -> 'r t
Use return a
to apply the binding to a
.
Use map m f
to return the result of applying f
to the value returned by m
.
module Infix : Cf_monad_core.Unary.Infix with type 'r t := 'r t
Open Infix
to include the infix monad operators.
include Cf_seqmonad.Functor.Unary with type 'r t := 'r t
Use collect s
to bind in sequence every monad value in the finite sequence s
and collect all the returned values. Returns (n, s)
where n
is the number of values collected and s
is the list of values in reverse order, i.e. from last collected to first collected. Never returns and exhausts all memory if s
never terminates.
val nil : 'r t
A scanner that never produces any value.
val fin : bool t
A scanner that returns true
at the end of the input sequence, otherwise returns false
.
Backtracking
val pos : mark -> unit Annot.form
Use pos mark
to make a unit value attributed with the position of captured mark
. Raises Not_found
if mark
was captured at the end of the input stream.
Terminal Scanner
val any : Json_event.t Annot.form t
The universal symbol scanner. Recognizes any symbol in the input stream and produces its form. Does not produce anything at the end of input.
val one : Json_event.t -> Json_event.t Annot.form t
The literal symbol scanner. Use one symbol
to make a scanner that recognizes symbol
in the input stream and produces its form.
val sat : (Json_event.t -> bool) -> Json_event.t Annot.form t
The symbol satisfier scanner. Use sat f
to make a scanner that recognizes any symbol for which applying f
returns true
and produces its form.
val ign : (Json_event.t -> bool) -> unit Annot.form t
The ignore scanner. Use ign f
to make a scanner that scans the input while applying f
to each symbol returns true
, then produces a unit form that annotates the span of ignored symbols. Produces an implicit unit form if the end of input has already been reached.
val tok : (Json_event.t -> 'r option) -> 'r Annot.form t
The symbolic token scanner. Use tok f
to make a scanner that recognizes any symbol for which applying f
returns Some v
, then produces the form of v
.
Scanner Composers
val ntyp : 'r Cf_type.nym -> 'r Annot.form t -> Cf_type.opaque Annot.form t
The opaque value form scanner composer. Use ntyp n p
to make a scanner that encloses the value contained in the form produced by p
in an opaque value with the runtime type indicated by n
and returns its form in the same position.
The default value scanner. Use dflt v p
to produce the output of p
or the default value v
if p
does not produce output.
The optional scanner composer. Use opt p
to make a scanner that produces either Some v
if p
produces v
otherwise None
.
The visitor scanner composer. Use vis ?a ?b f v
to compose a scanner that recognizes a sequence of elements in the input stream by applying a visitor function f
at each element to obtain its scanner. The first element is visited with the initializer v
, and each following element is visited with the value returned by the preceding scanner.
If ~a
is used, then it specifies the minimum number of elements to visit. If ~b
is used then it specifies the maximum number of elements to visit. Composition raises Invalid_argument
if a < 0
or b < a
.
The homogenous list scanner composer. Use seq ?a ?b p
to create a new scanner that uses p
to recognize and produce, in order, each element in a sequence of elements in the input stream.
If ~a
is used, then it specifies the minimum number of elements that must be recognized and produced in the output. If ~b
is used then it specifies the maximum number of elements to recognize. Composition raises Invalid_argument
if a < 0
or b < a
.
The bounded multiple choice scanner. Use alt ps
to create a scanner that produces the output from the first scanner ps
that produces. If no scanner in ps
produces output, then the resulting scanner does not produce.
The unbounded multiple choice scanner. Use alt ps
to create a scanner that produces the output from the first scanner ps
that produces. If no scanner in ps
produces output, then the resulting scanner does not produce.
Error Parsers
A distinguished syntax failure exception.
val fail : string -> 'r t
Use fail msg
to raise Bad_syntax
with msg
optionally annotated with the current position.
Use or_fail msg p
to make a scanner that raises Bad_syntax
with msg
if p
does not recognize its input. It may be convenient to call this function with a pipeline operator, i.e. p |> or_fail "reasons"
.
val err : ?x:exn -> unit -> 'r t
Use err ~x ()
to make a scanner that raises x
. If ?x
is not provided, then it raises Not_found
.
Use errf ~xf ()
to make a scanner that captures a mark and applies it to xf
to raise an exception. If ?xf
is not provided, then raises Not_found
.
Use req ~x p
to make a scanner that either produces the output of p
or raises x
. If p
does not produce and ?x
is not provided, then it raises Not_found
.
Use reqf ~xf p
to make a scanner that either produces the output of p
or captures a mark at the current input and applies it to xf
to raise an exception. If ?xf
is not provided, then raises Not_found
.
The error check scanner. Use ck p
to create a new scanner that either produces either Ok v
if p
produces v
or Error x
if scanning the input with p
raises the exception x
.
The error recovery scanner. Use sync p
to scan the input with p
until it produces or reaches the end of input. Produces Some v
if p
ever produces v
, otherwise produces None
if the end of input is reached without p
producing a value.
Elaboration
val lift :
?start:Cf_annot.Textual.position ->
'r t ->
Json_event.t Seq.t ->
'r Seq.t
Use lift p s
to map s
into a persistent sequence of the values produced by p
. If ~start
is provided, then it specifies the starting position of the first symbol in s
.
val of_seq :
?start:Cf_annot.Textual.position ->
'r t ->
Json_event.t Seq.t ->
'r
Use of_seq p s
to parse s
with p
and return the result. Raises Not_found
if p
does not recognize the entire sequence of s
. If ~start
is provided, then it specifies the starting position of the first symbol in s
.
module Affix : sig ... end
Combinator operators
val lift_staged :
?start:Cf_annot.Textual.position ->
'r t ->
Uchar.t Seq.t ->
'r Seq.t
Use lift_staged p s
to lift s
into the first stage token sequence, then map it into a persistent sequence of the values produced by p
. If ~start
is provided, then it specifies the starting position of the first symbol in s
.
val of_seq_staged :
?start:Cf_annot.Textual.position ->
'r t ->
Uchar.t Seq.t ->
'r
Use of_seq_staged p s
to lift s
into the first stage token sequence, then parse it with p
, and return the result. Raises Not_found
if p
does not recognize the entire sequence of s
. If ~start
is provided, then it specifies the starting position of the first symbol in s
.
val event : Json_event.t Annot.form t
The parser that returns the next annotated event that is not space.
val space : unit Annot.form t
The parser that recognizes a space event.
Scalar Values
The following parsers all ignore any white space encountered in the input before recognizing the scalar value.
val null : unit Annot.form t
A parser that recognizes the null literal.
val boolean : bool Annot.form t
The parser that recognizes either the true or false literal and returns the value of annotated boolean value accordingly.
val integer : int Annot.form t
The parser that recognizes an integer number within the range of Ocaml and returns the annotated integer value accordingly.
val float : float Annot.form t
The parser that recognizes a number within the range of Ocaml floating point numbers and returns the annotated float value accordingly.
val string : Ucs_text.t Annot.form t
The parser that recognizes a string literal and returns the annotated Unicode text with all the escaped sequences converted.
val signal : [< Json_event.signal ] -> unit Annot.form t
Use signal x
to make a parser that recognizes the event signal x
.
Aggregate Values
The following interfaces are provided to facilitate various strategies for scanning arrays and objects.
module Chain :
Cf_chain_scan.Profile
with type symbol := Json_event.t Annot.form
and type mark := mark
and type 'a t := 'a t
and type 'a form := 'a
The chain scanner module.
val chain : ?xf:(mark -> 'x) -> unit -> Chain.chain
Use chain ()
to make a chain discipline for array and object content, which is a list of elements separated by semi-colon characters.
module Object :
Cf_record_scan.Profile
with type index := Ucs_text.t
and type 'a form := 'a Annot.form
and type 'a t := 'a t
The record scanner module for JSON object content.
val group : [< `Array | `Object ] -> 'r t -> 'r Annot.form t
Use group kind content
to make a parser that recognizes an array or an object form according to kind
with content that must be recognized by content
.
For example, use float |> seq ~a:1 ?b |> group `Array
to scan an array of one or more floating point numbers.
Alternatively, use Object.scan schema |> group `Object
to scan an object according to schema
.
val value : Cf_type.opaque Annot.form t
A parser that recognizes any JSON encoded value and produces an annotated opaque value with encapsulated runtime type indication. The following table describes the runtime type indications produced for values decoded.
- null:
Cf_type.Unit
- boolean:
Cf_type.Bool
- number:
Cf_type.Int
orCf_type.Float
- string:
Ucs_type.Text
- array:
Cf_type.Seq Cf_type.Opaque
- object:
Cf_type.(Seq (Pair (Ucs_type.Text, Cf_type.Opaque))
Use the Ucs_type
module for unpacking.
Note well: the array and object containers are sequences of untyped values, which must be unpacked recursively. Exceeding a maximum recursion depth of 1000 raises Bad_syntax
.
Conveniences
val of_text : 'a t -> Ucs_text.t -> 'a
Use of_text p t
to parse the Unicode text s
with p
.