Skip to content

komm.FixedToVariableCode

General fixed-to-variable length code. A fixed-to-variable length code with source alphabet S\mathcal{S}, target alphabet T\mathcal{T}, and source block size kk is defined by an encoding mapping Enc:SkT+\Enc : \mathcal{S}^k \to \mathcal{T}^+, where the domain is the set of all kk-tuples with entries in S\mathcal{S}, and the co-domain is the set of all finite-length, non-empty tuples with entries in T\mathcal{T}. Here we assume that S=[0:S)\mathcal{S} = [0:S) and T=[0:T)\mathcal{T} = [0:T), for integers S2S \geq 2 and T2T \geq 2. The elements in the image of Enc\Enc are called codewords.

from_enc_mapping() classmethod

Constructs a fixed-to-variable length code from the encoding mapping Enc\Enc.

Parameters:

  • enc_mapping (dict[Word, Word])

    The encoding mapping Enc\Enc. Must be a dictionary whose keys are all the kk-tuples of integers in [0:S)[0:S) and whose values are non-empty tuples of integers in [0:T)[0:T).

Notes

The source block size kk is inferred from the domain of the encoding mapping, and the source and target cardinalities SS and TT are inferred from the maximum values in the domain and co-domain, respectively.

Examples:

>>> code = komm.FixedToVariableCode.from_enc_mapping({
...     (0,): (0,),
...     (1,): (1, 0),
...     (2,): (1, 1),
... })
>>> code.source_cardinality, code.target_cardinality, code.source_block_size
(3, 2, 1)
>>> code.codewords
[(0,), (1, 0), (1, 1)]
>>> code = komm.FixedToVariableCode.from_enc_mapping({
...     (0, 0): (0,),
...     (0, 1): (1, 1),
...     (1, 0): (1, 1, 0),
...     (1, 1): (1, 0, 1),
... })
>>> code.source_cardinality, code.target_cardinality, code.source_block_size
(2, 2, 2)
>>> code.codewords
[(0,), (1, 1), (1, 1, 0), (1, 0, 1)]

from_codewords() classmethod

Constructs a fixed-to-variable length code from the source cardinality SS and a list of codewords.

Parameters:

  • source_cardinality (int)

    The source cardinality SS. Must be an integer greater than or equal to 22.

  • codewords (list[Word])

    The codewords of the code. Must be a list of length SkS^k containing tuples of integers in [0:T)[0:T), where TT is the target cardinality of the code. The tuple in position ii must be equal to Enc(u)\Enc(u), where uu is the ii-th element in the lexicographic ordering of [0:S)k[0:S)^k.

Notes

The source block size kk is inferred from the length of the codewords, and the target cardinality TT is inferred from the maximum value in the codewords.

Examples:

>>> code = komm.FixedToVariableCode.from_codewords(
...     source_cardinality=3,
...     codewords=[(0,), (1,0), (1,1)],
... )
>>> code.source_cardinality, code.target_cardinality, code.source_block_size
(3, 2, 1)
>>> code.enc_mapping
{(0,): (0,),
 (1,): (1, 0),
 (2,): (1, 1)}
>>> code = komm.FixedToVariableCode.from_codewords(
...     source_cardinality=2,
...     codewords=[(0,), (1,1), (1,1,0), (1,0,1)]
... )
>>> (code.source_cardinality, code.target_cardinality, code.source_block_size)
(2, 2, 2)
>>> code.enc_mapping
{(0, 0): (0,),
 (0, 1): (1, 1),
 (1, 0): (1, 1, 0),
 (1, 1): (1, 0, 1)}

source_cardinality int cached property

The source cardinality SS of the code. It is the number of symbols in the source alphabet.

Examples:

>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.source_cardinality
3

target_cardinality int cached property

The target cardinality TT of the code. It is the number of symbols in the target alphabet.

Examples:

>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.target_cardinality
2

source_block_size int cached property

The source block size kk of the code. It is the number of symbols in each source block.

Examples:

>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.source_block_size
1

size int cached property

The number of codewords in the code. It is equal to SkS^k.

Examples:

>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.size
3

enc_mapping dict[Word, Word] cached property

The encoding mapping Enc\Enc of the code. It is a dictionary of length SkS^k whose keys are all the kk-tuples of integers in [0:S)[0:S) and whose values are the corresponding codewords.

Examples:

>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.enc_mapping
{(0,): (0,), (1,): (1, 0), (2,): (1, 1)}

codewords list[Word] cached property

The codewords of the code. They correspond to the image of the encoding mapping Enc\Enc.

Examples:

>>> code = komm.FixedToVariableCode.from_enc_mapping({
...     (0,): (0,),
...     (1,): (1, 0),
...     (2,): (1, 1),
... })
>>> code.codewords
[(0,), (1, 0), (1, 1)]

is_uniquely_decodable() cached

Returns whether the code is uniquely decodable. A code is uniquely decodable if there is a unique way to parse any concatenation of codewords.

Examples:

>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.is_uniquely_decodable()
True
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (0,1), (1,1)])
>>> code.is_uniquely_decodable()
True
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (0,1), (1,0)])
>>> code.is_uniquely_decodable()  # 010 can be parsed as 0|10 or 01|0
False

is_prefix_free() cached

Returns whether the code is prefix-free. A code is prefix-free if no codeword is a prefix of any other codeword.

Examples:

>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.is_prefix_free()
True
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (0,1), (1,1)])
>>> code.is_prefix_free()
False
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (0,1), (1,0)])
>>> code.is_prefix_free()
False

kraft_parameter() cached

Computes the Kraft parameter KK of the code. This quantity is given by K=uSkTu, K = \sum_{u \in \mathcal{S}^k} T^{-{\ell_u}}, where u\ell_u is the length of the codeword Enc(u)\Enc(u), TT is the target cardinality, and kk is the source block size.

Returns:

  • kraft_parameter (float)

    The Kraft parameter KK of the code.

Examples:

>>> code = komm.FixedToVariableCode.from_codewords(5, [(0,0,0), (0,0,1), (0,1,0), (1,0,1), (1,1)])
>>> code.kraft_parameter()
np.float64(0.75)
>>> code = komm.FixedToVariableCode.from_codewords(4, [(0,), (1,0), (1,1,0), (1,1,1)])
>>> code.kraft_parameter()
np.float64(1.0)
>>> code = komm.FixedToVariableCode.from_codewords(4, [(0,0), (1,1), (0,), (1,)])
>>> code.kraft_parameter()
np.float64(1.5)

rate()

Computes the expected rate RR of the code, considering a given pmf. This quantity is given by R=nˉk, R = \frac{\bar{n}}{k}, where nˉ\bar{n} is the expected codeword length, assuming iid source symbols drawn from pXp_X, and kk is the source block size. It is measured in TT-ary digits per source symbol.

Parameters:

  • pmf (ArrayLike)

    The (first-order) probability mass function pXp_X to be considered.

Returns:

  • rate (float)

    The expected rate RR of the code.

Examples:

>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.rate([1/2, 1/4, 1/4])
np.float64(1.5)

encode()

Encodes a sequence of source symbols using the code, which must be uniquely decodable.

Parameters:

  • input (ArrayLike)

    The sequence of symbols to be encoded. Must be a 1D-array with elements in [0:S)[0:S) (where SS is the source cardinality of the code) and have a length that is a multiple of the source block size kk.

Returns:

  • output (NDArray[integer])

    The sequence of encoded symbols. It is a 1D-array with elements in [0:T)[0:T) (where TT is the target cardinality of the code).

Examples:

>>> code = komm.FixedToVariableCode.from_enc_mapping({
...     (0, 0): (0,),
...     (0, 1): (1, 1),
...     (1, 0): (1, 0, 0),
...     (1, 1): (1, 0, 1),
... })
>>> code.encode([0, 1, 0, 0])
array([1, 1, 0])
>>> code.encode([0, 1, 0])  # Not a multiple of the source block size
Traceback (most recent call last):
...
ValueError: length of input must be a multiple of block size 2 (got 3)
>>> code.encode([0, 7, 0, 0])  # 07 is not a valid source word
Traceback (most recent call last):
...
ValueError: input contains invalid word
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (0,1), (1,0)])
>>> code.encode([0, 1, 0])  # Code is not uniquely decodable
Traceback (most recent call last):
...
ValueError: code is not uniquely decodable

decode()

Decodes a sequence of target symbols using the code, which must be uniquely decodable.

Warning

Decoding for non-prefix-free codes is not implemented yet.

Parameters:

  • input (ArrayLike)

    The sequence of symbols to be decoded. Must be a 1D-array with elements in [0:T)[0:T) (where TT is the target cardinality of the code). Also, the sequence must be a concatenation of codewords (i.e., the output of the encode method).

Returns:

  • output (NDArray[integer])

    The sequence of decoded symbols. It is a 1D-array with elements in [0:S)[0:S) (where SS is the source cardinality of the code) with a length that is a multiple of the source block size kk.

Examples:

>>> code = komm.FixedToVariableCode.from_enc_mapping({
...     (0, 0): (0,),
...     (0, 1): (1, 1),
...     (1, 0): (1, 0, 0),
...     (1, 1): (1, 0, 1),
... })
>>> code.decode([1, 1, 0])
array([0, 1, 0, 0])
>>> code.decode([0, 0, 1])  # Not a concatenation of codewords
Traceback (most recent call last):
...
ValueError: input contains invalid word
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (0,1), (1,0)])
>>> code.decode([0, 1, 0])  # Code is not uniquely decodable
Traceback (most recent call last):
...
ValueError: code is not uniquely decodable