komm.FixedToVariableCode
General fixed-to-variable length code. A fixed-to-variable length code with source alphabet , target alphabet , and source block size is defined by an encoding mapping , where the domain is the set of all -tuples with entries in , and the co-domain is the set of all finite-length, non-empty tuples with entries in . Here we assume that and , for integers and . The elements in the image of are called codewords.
from_enc_mapping()
classmethod
Constructs a fixed-to-variable length code from the encoding mapping .
Parameters:
-
enc_mapping
(dict[Word, Word]
) –The encoding mapping . Must be a dictionary whose keys are all the -tuples of integers in and whose values are non-empty tuples of integers in .
Notes
The source block size is inferred from the domain of the encoding mapping, and the source and target cardinalities and are inferred from the maximum values in the domain and co-domain, respectively.
Examples:
>>> code = komm.FixedToVariableCode.from_enc_mapping({
... (0,): (0,),
... (1,): (1, 0),
... (2,): (1, 1),
... })
>>> code.source_cardinality, code.target_cardinality, code.source_block_size
(3, 2, 1)
>>> code.codewords
[(0,), (1, 0), (1, 1)]
>>> code = komm.FixedToVariableCode.from_enc_mapping({
... (0, 0): (0,),
... (0, 1): (1, 1),
... (1, 0): (1, 1, 0),
... (1, 1): (1, 0, 1),
... })
>>> code.source_cardinality, code.target_cardinality, code.source_block_size
(2, 2, 2)
>>> code.codewords
[(0,), (1, 1), (1, 1, 0), (1, 0, 1)]
from_codewords()
classmethod
Constructs a fixed-to-variable length code from the source cardinality and a list of codewords.
Parameters:
-
source_cardinality
(int
) –The source cardinality . Must be an integer greater than or equal to .
-
codewords
(list[Word]
) –The codewords of the code. Must be a list of length containing tuples of integers in , where is the target cardinality of the code. The tuple in position must be equal to , where is the -th element in the lexicographic ordering of .
Notes
The source block size is inferred from the length of the codewords, and the target cardinality is inferred from the maximum value in the codewords.
Examples:
>>> code = komm.FixedToVariableCode.from_codewords(
... source_cardinality=3,
... codewords=[(0,), (1,0), (1,1)],
... )
>>> code.source_cardinality, code.target_cardinality, code.source_block_size
(3, 2, 1)
>>> code.enc_mapping
{(0,): (0,),
(1,): (1, 0),
(2,): (1, 1)}
>>> code = komm.FixedToVariableCode.from_codewords(
... source_cardinality=2,
... codewords=[(0,), (1,1), (1,1,0), (1,0,1)]
... )
>>> (code.source_cardinality, code.target_cardinality, code.source_block_size)
(2, 2, 2)
>>> code.enc_mapping
{(0, 0): (0,),
(0, 1): (1, 1),
(1, 0): (1, 1, 0),
(1, 1): (1, 0, 1)}
source_cardinality
int
cached
property
The source cardinality of the code. It is the number of symbols in the source alphabet.
Examples:
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.source_cardinality
3
target_cardinality
int
cached
property
The target cardinality of the code. It is the number of symbols in the target alphabet.
Examples:
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.target_cardinality
2
source_block_size
int
cached
property
The source block size of the code. It is the number of symbols in each source block.
Examples:
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.source_block_size
1
size
int
cached
property
The number of codewords in the code. It is equal to .
Examples:
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.size
3
enc_mapping
dict[Word, Word]
cached
property
The encoding mapping of the code. It is a dictionary of length whose keys are all the -tuples of integers in and whose values are the corresponding codewords.
Examples:
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.enc_mapping
{(0,): (0,), (1,): (1, 0), (2,): (1, 1)}
codewords
list[Word]
cached
property
The codewords of the code. They correspond to the image of the encoding mapping .
Examples:
>>> code = komm.FixedToVariableCode.from_enc_mapping({
... (0,): (0,),
... (1,): (1, 0),
... (2,): (1, 1),
... })
>>> code.codewords
[(0,), (1, 0), (1, 1)]
is_uniquely_decodable()
cached
Returns whether the code is uniquely decodable. A code is uniquely decodable if there is a unique way to parse any concatenation of codewords.
Examples:
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.is_uniquely_decodable()
True
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (0,1), (1,1)])
>>> code.is_uniquely_decodable()
True
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (0,1), (1,0)])
>>> code.is_uniquely_decodable() # 010 can be parsed as 0|10 or 01|0
False
is_prefix_free()
cached
Returns whether the code is prefix-free. A code is prefix-free if no codeword is a prefix of any other codeword.
Examples:
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.is_prefix_free()
True
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (0,1), (1,1)])
>>> code.is_prefix_free()
False
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (0,1), (1,0)])
>>> code.is_prefix_free()
False
kraft_parameter()
cached
Computes the Kraft parameter of the code. This quantity is given by where is the length of the codeword , is the target cardinality, and is the source block size.
Returns:
-
kraft_parameter
(float
) –The Kraft parameter of the code.
Examples:
>>> code = komm.FixedToVariableCode.from_codewords(5, [(0,0,0), (0,0,1), (0,1,0), (1,0,1), (1,1)])
>>> code.kraft_parameter()
np.float64(0.75)
>>> code = komm.FixedToVariableCode.from_codewords(4, [(0,), (1,0), (1,1,0), (1,1,1)])
>>> code.kraft_parameter()
np.float64(1.0)
>>> code = komm.FixedToVariableCode.from_codewords(4, [(0,0), (1,1), (0,), (1,)])
>>> code.kraft_parameter()
np.float64(1.5)
rate()
Computes the expected rate of the code, considering a given pmf. This quantity is given by where is the expected codeword length, assuming iid source symbols drawn from , and is the source block size. It is measured in -ary digits per source symbol.
Parameters:
-
pmf
(ArrayLike
) –The (first-order) probability mass function to be considered.
Returns:
-
rate
(float
) –The expected rate of the code.
Examples:
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (1,0), (1,1)])
>>> code.rate([1/2, 1/4, 1/4])
np.float64(1.5)
encode()
Encodes a sequence of source symbols using the code, which must be uniquely decodable.
Parameters:
-
input
(ArrayLike
) –The sequence of symbols to be encoded. Must be a 1D-array with elements in (where is the source cardinality of the code) and have a length that is a multiple of the source block size .
Returns:
-
output
(NDArray[integer]
) –The sequence of encoded symbols. It is a 1D-array with elements in (where is the target cardinality of the code).
Examples:
>>> code = komm.FixedToVariableCode.from_enc_mapping({
... (0, 0): (0,),
... (0, 1): (1, 1),
... (1, 0): (1, 0, 0),
... (1, 1): (1, 0, 1),
... })
>>> code.encode([0, 1, 0, 0])
array([1, 1, 0])
>>> code.encode([0, 1, 0]) # Not a multiple of the source block size
Traceback (most recent call last):
...
ValueError: length of input must be a multiple of block size 2 (got 3)
>>> code.encode([0, 7, 0, 0]) # 07 is not a valid source word
Traceback (most recent call last):
...
ValueError: input contains invalid word
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (0,1), (1,0)])
>>> code.encode([0, 1, 0]) # Code is not uniquely decodable
Traceback (most recent call last):
...
ValueError: code is not uniquely decodable
decode()
Decodes a sequence of target symbols using the code, which must be uniquely decodable.
Warning
Decoding for non-prefix-free codes is not implemented yet.
Parameters:
-
input
(ArrayLike
) –The sequence of symbols to be decoded. Must be a 1D-array with elements in (where is the target cardinality of the code). Also, the sequence must be a concatenation of codewords (i.e., the output of the
encode
method).
Returns:
-
output
(NDArray[integer]
) –The sequence of decoded symbols. It is a 1D-array with elements in (where is the source cardinality of the code) with a length that is a multiple of the source block size .
Examples:
>>> code = komm.FixedToVariableCode.from_enc_mapping({
... (0, 0): (0,),
... (0, 1): (1, 1),
... (1, 0): (1, 0, 0),
... (1, 1): (1, 0, 1),
... })
>>> code.decode([1, 1, 0])
array([0, 1, 0, 0])
>>> code.decode([0, 0, 1]) # Not a concatenation of codewords
Traceback (most recent call last):
...
ValueError: input contains invalid word
>>> code = komm.FixedToVariableCode.from_codewords(3, [(0,), (0,1), (1,0)])
>>> code.decode([0, 1, 0]) # Code is not uniquely decodable
Traceback (most recent call last):
...
ValueError: code is not uniquely decodable