package fmlib_parse
Parsing with combinators and indentation sensitivity
Install
Dune Dependency
Authors
Maintainers
Sources
0.5.11.tar.gz
sha256=987144e79a5ab8544a9cac669284ef7610a70c3362d4f55e5d27e4f33b49a1b9
doc/fmlib_parse/Fmlib_parse/Ucharacter/index.html
Module Fmlib_parse.Ucharacter
Parser for streams of unicode characters.
There are several possibilities to encode unicode characters in byte streams.
- utf8: Encodes a unicode character in 1 to 4 bytes. The ascii characters are included as a special case. Mostly used to transfer unicode text data on the internet and on unix based platforms (like MacOS).
- utf16: Encodes a unicode character in 2 or 4 bytes. The whole basic mulilingual plane is encoded in 2 bytes and all the other planes need 4 bytes. Mostly used on windows platforms and in javascript. For text streams big and littly endian has to be distinguished.
There are the following modules available:
Make_utf8
: Parse text streams encoded inutf-8
.
Make_utf16_be
: Parse text streams encoded inutf-16
big endian.
Make_utf16_le
: Parse text streams encoded inutf-16
little endian.
Make
: Parse text streams in any encoding. The encoder and decoder have to be provided as module parameter.
All parsers in this module work like a character parser (see Character.Make
) with some additional combinators to recognize unicode characters.
module Make_utf8
(State : Fmlib_std.Interfaces.ANY)
(Final : Fmlib_std.Interfaces.ANY)
(Semantic : Fmlib_std.Interfaces.ANY) :
sig ... end
Parse an input stream consisting of unicode characters encoded in utf-8.
module Make_utf16_be
(State : Fmlib_std.Interfaces.ANY)
(Final : Fmlib_std.Interfaces.ANY)
(Semantic : Fmlib_std.Interfaces.ANY) :
sig ... end
Parse an input stream consisting of unicode characters encoded in utf-16 big endian.
module Make_utf16_le
(State : Fmlib_std.Interfaces.ANY)
(Final : Fmlib_std.Interfaces.ANY)
(Semantic : Fmlib_std.Interfaces.ANY) :
sig ... end
Parse an input stream consisting of unicode characters encoded in utf-16 little endian.
module Make
(Codec : Interfaces.CHAR_CODEC)
(State : Fmlib_std.Interfaces.ANY)
(Final : Fmlib_std.Interfaces.ANY)
(Semantic : Fmlib_std.Interfaces.ANY) :
sig ... end
Parse an input stream consisting of unicode characters. The unicode characters are encoded and decoded by using the module Codec
.
sectionYPositions = computeSectionYPositions($el), 10)"
x-init="setTimeout(() => sectionYPositions = computeSectionYPositions($el), 10)"
>