1. Notation

We denote by $F$ to a finite field of prime order $p$ and $F^{\times}$ to its respective multiplicative group and define $k$ to be the biggest non-negative integer such that $2^{k} ∣ (p - 1)$ . We also write $K$ to denote a finite field extension of $F$ , of size $p^{e}$ , $e \geq 2$ . Furthermore, we write $F [X]$ (resp. $K [X]$ ) for the ring of polynomials with coefficients over $F$ (resp. $K$ ) and write $F_{< d} [X]$ (resp. $F_{< d} [X]$ ) to denote the set of polynomials of degree lower than $d$ .

2.Multiset Equality

The details and examples can be found in Permutation argument.

Grand Product

let $G = ⟨ g ⟩$ is a cyclic subgroup of $F^{\times}$ of order n.

Given two vectors $f = (f_{1}, \dots, f_{n})$ and $t = (t_{1}, \dots, t_{n})$ in $F^{n}$ , a multiset equality argument, denoted $f ≐ t$ , is used for checking that $f$ is equal to $t$ as multisets (or equivalently, that $f$ and $t$ are a permutation of each other). The protocol that instantiates the multiset equality arguments works by computing the following grand product polynomial $Z \in K_{< n} [X]$ : $Z (g^{i}) = {1, if i = 1 \prod_{j = 1}^{i - 1} \frac{f _{j} + γ}{t _{j} + γ}, if i = 2, \dots, n (1)$ where $γ \in K$ is the value sent from the verifier.

Soundness of Multiset Equality

Lemma 1: Fix two vectors $f = (f_{1}, \dots, f + n)$ and $t = (t_{1}, \dots, t_{n})$ in $F^{n}$ . If the following holds with probability larger than $ε_{M u lEq} (n) := n /∣ K ∣$ over a random $γ \in K$ : $j = 1 \prod n (f_{i} + γ) = i = 1 \prod n (t_{i} + γ) (2)$ Then $f ≐ t$ . As a consequence of Lemma 1, the identities that must be checked by the verifier for $x \in G$ are the following:

$L_{1} (x) \cdot (Z (x) - 1) = 0 Z (x \cdot g) \cdot (t (x) + γ) = Z (x) \cdot (f (x) + γ) (3)$ where $f, t \in F_{< n} [X]$ are the polynomials resulting from the interpolation of ${f_{i}}_{i \in [n]}$ and ${t_{i}}_{i \in [n]}$ over $G$ ,respectively.

The commitment polynomial is the grand product polynomial $Z \in K_{< n} [X]$ . and the two constraints is express in Eq $(3)$ .

3.Connection

The protocol for a connection argument and the definitions and results we provide next are adapted from GWC19. The details and examples can be found in Connection argument.

Given some vectors $f_{1}, \dots, f_{k} \in F^{n}$ and a partition $T = {T_{1}, \dots, T_{s}}$ of the set $[kn]$ , a connection argument, denoted $(f_{1}, \dots, f_{k}) \propto {T_{1}, \dots, T_{s}}$ , is used to check that the partition $T$ divides the field elements ${f_{i, j}}_{i \in [k], j \in [n}]$ into sets with the same value. More specifically, if we define the sequence $f_{(1)}, \dots, f_{(kn)} \in F$ by:

$f_{((i - 1) n + j)} := f_{i, j} (4)$ for each $i \in [k], j \in [n]$ , then we have $f_{(l 1)} = f_{(l 2)}$ if and only if $l_{1}, l_{2}$ belong to the same block of $T$ .

In order to express the partition $T$ within a grand product polynomial, we define a permutation $σ : [kn] \to [kn]$ as follows: $σ$ is such that for each block $T_{i}$ of $T$ , σ(T ) contains a cycle going over all elements of $T_{i}$ . Then, the protocol that instantiates the connection arguments works by computing the following grand product polynomial $Z \in K_{< n} [X]$ :

$Z (g^{i}) = {1, if i = 1 \prod_{l = 1}^{k} \prod_{j = 1}^{i - 1} \frac{f _{l, j} + γ \cdot (( l - 1 ) \cdot n + j ) + δ}{f _{l, j} + γ \cdot σ (( l - 1 ) \cdot n + j ) + δ )}, if i = 2, \dots, n (5)$ where $γ, δ \in K$ are the values sent from the verifier.

The definition of the previous polynomial is based on the following lemma, a proof of which can be found in Claim A.1. of GWC19.

Lemma 2 Fix $f_{1}, \dots, f_{k} \in F^{n}$ and a partition $T = {T_{1}, \dots, T_{s}}$ of the set $[kn]$ . If the following holds with probability larger than $ε_{C o n} (n) := kn /∣ K ∣$ over a randoms $γ, δ \in K$ : $l = 1 \prod k j = 1 \prod n (f_{l, j} + γ \cdot ((l - 1) \cdot n + j) + δ) = l = 1 \prod k j = 1 \prod n (f_{l, j} + γ \cdot σ ((l - 1) \cdot n + j) + δ) (6)$ As a consequence of Lemma 2, the identities that must be checked by the verifier for $x \in G$ are the following: $L_{1} (x) \cdot (Z (x) - 1) = 0, Z (x \cdot g) \cdot l = 1 \prod k (f_{l} (x) + γ \cdot S_{σ_{l}} (x) + δ) = Z (x) \cdot l = 1 \prod k (f_{l} (x) + γ \cdot S_{I D_{l}} (x) + δ), (7)$ where $S_{I D_{i}} (g^{j}) = (i - 1) \cdot n + j$ is the polynomial mapping G-elements to indexes in [kn] and $S_{σ_{i}} (g^{j}) = σ ((i - 1) \cdot n + j)$ is the polynomial defined by $σ$ . For more details see GWC19.

The commitment polynomial is the grand product polynomial $Z \in K_{< n} [X]$ . and the two constraints is express in Eq $(7)$ .

4.inclusion

The protocol for an inclusion argument and the definitions and results we provide next is adapted from the well-known Plookup protocol GW20, with the “alternating method” provided in [PFM+22](https://eprint.iacr.org/ 2022/086). The details and examples can be found in Inclusion argument.

Given two vectors $f = (f_{1}, \dots, f_{n})$ and $t = (t_{1}, \dots, t_{n})$ in $F^{n}$ , a inclusion argument, denoted $f \in t$ , is used for checking that the set A formed with the values ${f_{i}}_{i \in [n]}$ is contained in the set B formed with the values ${t}_{i \in [n]}$ . Notice that $∣ A ∣, ∣ B ∣ \leq n$ .

In the protocol, the prover has to construct an auxiliary vector $s = (s_{1}, \dots, s_{2 n})$ containing every element of f and t where the order of appearance is the same as in $t$ . The main idea behind the protocol is that if $f \in t$ , then f contributes to s with repeated elements. To check this fact, a vector $Δ s$ is defined as follows: $Δ s = (s_{1} + γ s_{2}, s_{2} + γ s_{3}, \dots, s_{2 n} + γ s_{1}) (8)$ Then, the protocol essentially checks that $Δ s$ is consistent with the elements of f, t and s. To do so, the vector s is split into two vectors $h_{1}, h_{2} \in F^{n}$ . In the protocol described in GW20, $h_{1}$ and $h_{2}$ contain the lower and upper halves of s, while in our protocol in [PFM+22](https://eprint.iacr.org/ 2022/086), we use $h_{1}$ to store elements with odd indexes and $h_{2}$ for even indexes, that is: $h_{1} = (s_{1}, s_{3}, s_{5}, \dots, s_{2 n - 1}) and h_{2} = (s_{2}, s_{4}, s_{6}, \dots, s_{2 n}) (9)$ With this setting in mind, the grand product polynomial is deﬁned as: $Z (g^{i}) = {1, if i = 1 (1 + γ)^{i - 1} \prod_{j = 1}^{i - 1} \frac{( δ + f _{j} ) ( δ ( 1 + γ ) + t _{j} + γ t _{j + 1} )}{( δ ( 1 + γ ) + s _{2 j - 1} + γ s _{2 j} ) ( γ ( 1 + γ ) + s _{2 j} + γ s _{2 j + 1} )}, if i = 2, \dots, n (10)$ where $γ, δ \in K$ are the values sent from the verifier.

The definition of the previous polynomial is based on the following lemma, which is a slight modification of Claim 3.1. of GW20.

Lemma 3 (Soundness of Inclusion). Fix three vectors $f = (f_{1}, \dots, f_{n}), t = (t_{1}, \dots ., t_{n})$ and $s = (s_{1}, \dots, s_{2 n})$ with elements in $F$ . If the following holds with probability larger than $ε_{I n c} (n) := \frac{( 4 n - 2 )}{∣ K ∣}$ over randoms $γ, δ \in K$ : $(1 + γ)^{n} i = 1 \prod n (γ + f_{i}) i = 1 \prod n - 1 (δ (1 + γ) + t_{i} + γ t_{i + 1}) = i = 1 \prod 2 n - 1 (δ (1 + γ) + s_{i} + γ s_{i + 1}) (11)$ then $f \in t$ and $s$ is the sorted by $t$ concatenation of $f$ and $t$ .

As a consequence of Lemma 3, the identities that must be checked by the verifier for $x \in G$ are the following: $L_{1} (x) (Z (x) - 1) = 0, Z (x \cdot g) = Z (x) \frac{( 1 + γ ) ( δ + f ( x )) ( δ ( 1 + γ ) + t ( x ) + γ t ( gx ))}{( δ ( 1 + γ ) + h _{1} ( x ) + γ h _{2} ( x )) ( δ ( 1 + γ ) + h _{2} ( x ) + γ h _{1} ( x \cdot g ))} (12)$ where $f, t \in F_{< n} [X]$ are the polynomials resulting from the interpolation of ${f_{i}}_{i \in [n]}$ and ${t_{i}}_{i \in [n]}$ over $G$ , respectively; $h_{1}$ and $h_{2} \in F_{< n} [X]$ are the polynomials resulting from the interpolation of the values defined in Eq $(8)$ over $G$ .

The commitment polynomial is the grand product polynomial $Z \in K_{< n} [X]$ and $h_{1}$ and $h_{2} \in F_{< n} [X]$ . and the two constraints is express in Eq $(12)$ .

5.From Vector Arguments to Simple Arguments

Let’s explain first how we reduce vector inclusions or multiset equalities to simple (i.e., involving only one polynomial on each side) inclusions or multiset equalities.

Definition 1 (Vector Arguments). Given polynomials $f_{i}, t_{i} \in K_{< n} [X]$ for $i \in [N]$ , a vector inclusion, denoted $(f_{1}, \dots, f_{N}) \in (t_{1}, \dots, t_{N})$ , is the argument in which for all $x \in G$ there exists some $y \in G$ such that: $(f_{1} (x), \dots, f_{N} (x)) = (t_{1} (x), \dots, t_{N} (x)) (13)$ A vector multiset equality, denoted $(f_{1}, \dots, f_{N}) ≐ (t_{1}, \dots, t_{N})$ , is the argument in which for all $y \in G$ there exists exactly one $x \in G$ for which Eq $(13)$ holds. That is, (vector) multiset equalities define a bijective mapping.

To reduce the previous vector arguments to simple ones, we make use of a uniformly sampled element $α \in K$ . Namely, instead of trying to generate an argument for the vector relation, we define the following polynomials: $F^{'} (x) := i = 1 \sum N α^{i - 1} f_{i} (X), T^{'} (x) := i = 1 \sum N α^{i - 1} t_{i} (X), (14)$ and proceed to prove the relation $F^{'} \in T^{'}$ or $F^{'} ≐ T^{'}$ . Notice that both $F^{'}$ and $T^{'}$ are in general polynomials with coefficients over the field extension $K$ even if every coefficient of $f_{i}$ , $t_{i}$ is precisely over the base field $F$ .

The previous reduction leads to the following result.

Lemma 4. Given polynomials $f_{i}, t_{i} \in K_{< n} [X]$ for $i \in [N]$ and $F^{'}, T^{'} \in K_{< n} [X]$ as defined by Eq. $(14)$ , if $F^{'} \in T^{'}$ (resp. $F^{'} ≐ T^{'}$ ), then $(f_{1}, \dots, f_{N}) \in (t_{1}, \dots, t_{N})$ (resp. $(f_{1}, \dots, f_{N}) ≐ (t_{1}, \dots, t_{N})$ ) except with probability $n \cdot (N - 1) /∣ K ∣$ over the random choice of $α$ .

6.From Selected Vector Arguments to Simple Arguments

Now, let’s go one step further by the introduction of selectors. Informally speaking, a selected inclusion (multiset equality) is an inclusion (multiset equality) not between the specified two polynomials $f$ , $t$ , but between the polynomials generated by the multiplication of $f$ and $t$ with (generally speaking) independently generated selectors. We generalize to the vector setting.

Definition 2 (Selected Vector Arguments). We are given polynomials $f_{i}, t_{i} \in K_{< n} [X]$ for $i \in [N]$ . Furthermore, we are also given two polynomials $f^{se l}, t^{se l} \in K_{< n} [X]$ whose range over the domain $G$ is {0,1}. That is, $f^{se l}$ and $t^{se l}$ are selectors. A selected vector inclusion, denoted $f^{se l} \cdot (f_{1}, \dots, f_{N}) \in t^{se l} \cdot (t_{1}, \dots, t_{N})$ , is the argument in which for all $x \in G$ there exists some $y \in G$ such that: $f^{se l} (x) \cdot (f_{1} (x), \dots, f_{N} (x)) \in t^{se l} (y) \cdot (t_{1} (y), \dots, t_{N} (y)) (15)$ where $f^{se l} (x) \cdot (f_{1} (x), \dots, f_{N} (x))$ denotes the component-wise scalar multiplication between the field element $f^{se l} (x)$ and the vector $(f_{1} (x), \dots, f_{N} (x))$ .

A selected vector multiset equality, denoted $f^{se l} \cdot (f_{1}, \dots, f_{N}) ≐ t^{se l} \cdot (t_{1}, \dots, t_{N})$ . is the argument in which for all $y \in G$ there exists exactly one $x \in G$ for which Eq. (15) holds.

Remark 1. Note that if $f^{se l} = t^{se l} = 1$ , then Eq. (15) is reduced to (13); if $f^{se l} = t^{se l} = 0$ then the argument is trivial; and if either $f^{se l}$ or $t^{se l}$ is equal to the constant 1, then we remove the need for $f^{se l}$ or $t^{se l}$ respectively.

To reduce selected vector inclusion to simple ones, we proceed in two steps. First, we use the reduction shown in Eq. $(14)$ to reduce the inner vector of polynomials to a single one. This process outputs polynomials $F^{'}, T^{'} \in K_{< n} [X]$ . Second, we make use of another uniformly sampled $β \in F$ as follows. Namely, we define the following polynomials: $T (X) := t^{se l} (X) (T^{'} (X) - β) + β, F (X) := f^{se l} (X) (F^{'} (X) - T^{'} (X)) + T^{'} (X), (16)$ and proceed to prove the relation $F \in T$ .

Importantly, the presentation “re-ordering” in Eq. $(16)$ is relevant: if $β$ had been introduced in the definition of $F$ instead, then there would be situations in which we would end up having $β$ as an inclusion value and therefore the inclusion argument not being satisfied even if the selectors are correct. See Example 1 to see why this is relevant.

Example 1. Choose $N = 1, n = 2^{3}$ . We compute the following values:

$x$	$f_{1} (x)$	$F^{'} (x)$	$f^{se l} (x)$	$F (x)$	$t_{1} (x)$	$T^{'} (x)$	$t^{se l} (x)$	$T (x)$
$g$	3	3	0	1	$1$	$1$	$1$	$1$
$g^{2}$	$7$	$7$	$1$	$7$	1	1	0	$β$
$g^{3}$	4	4	0	7	$7$	$7$	$1$	$7$
$g^{4}$	$1$	$1$	$1$	$1$	6	6	0	$β$
$g^{5}$	$5$	$5$	$1$	$5$	$5$	$5$	$1$	$5$
$g^{6}$	1	1	0	5	$5$	$5$	$1$	$5$
$g^{7}$	$2$	$2$	$1$	$2$	5	5	0	$β$
$g^{8}$	$5$	$5$	$1$	$5$	$2$	$2$	$1$	$2$

Notice how $F \in T$ . However, if we would have instead defined $F$ , $T$ as $F (X) = f^{se l} (X) [F ’ (X) - β] + β$ and $T (X) = t^{se l} (X) [T ’ (X) - F (X)] + F (X)$ then we would end up having $β$ as a inclusion value, which implies that $F \in / T$ even though $f_{1}$ , $t_{1}$ and $f^{se l}$ , $t^{se l}$ are correct.

To reduce selected vector multiset equalities to simple ones, we follow a similar process as with selected vector inclusions. We also first use the reduction in Eq. $(14)$ to reduce the inner vector argument to a simple one, but then we define the following polynomials: $T (X) := t^{se l} (X) (T^{'} (X) - β) + β F (X) := f^{se l} (X) (F^{'} (X) - β) + β (17)$ and proceed to prove the relation $F \in T$ . Here, we have been able to first define $F$ since we are dealing with multiset equalities instead of inclusions.

Similarly to Lemma 4, we obtain the following result. by observing that $β$ do not grow the total degree of polynomials $F, T$ (either from Eq. $(16)$ or Eq. $(17)$ ) over variables $α, β$ .

We generalize to selected vector arguments the protocols for (simple) inclusion arguments and multiset equality arguments explained in Section 2 and 4 by incorporating the reduction strategies explained in this section. Therefore, we give next the soundness bounds for these protocols.

Lemma 5. Given polynomials $f_{i}, t_{i} \in K_{< n} [X]$ for $i \in [N]$ and selectors $f^{se l}, t^{se l} \in K_{< n} [X]$ , we obtain:

Inclusion Protocol. Let $T \in K_{< 2 n - 1} [X]$ and $F \in K_{< 3 n - 1} [X]$ as defined by Eq. $(16)$ . The prover sends oracle functions $[f_{i}], [t_{i}], [f^{se l}], [t^{se l}]$ for $i \in [N]$ to the verifier in the first round, who responds with uniformly sampled $α, β \in K$ . Enlarge the set of identities that must be checked by the verifier in the inclusion protocol of Section 4 with:

$f^{se l} (x) (f_{se l} (x) - 1) = 0, t^{se l} (x) (t^{se l} (x) - 1) = 0, (18)$

for all $x \in G$ , i.e., the verifier checks that polynomials $f^{se l}, t^{se l}$ are valid selectors.

Multiset Equality Protocol. Let $F, T \in K_{< 2 n - 1} [X]$ as defined by Eq. $(17)$ . The prover sends oracle functions $[f_{i}], [t_{i}], [f^{se l}], [t^{se l}]$ for $i \in [N]$ to the verifier in the first round, who responds with uniformly sampled $α, β \in K$ . Enlarge the set of identities that must be checked by the verifier in the multiset equality protocol of Section 2 with: $f^{se l} (x) (f_{se l} (x) - 1) = 0, t^{se l} (x) (t^{se l} (x) - 1) = 0, (19)$ for all $x \in G$ .

Example 2. Say that for all $x \in G$ the prover wants to prove that he knows some polynomials $t r_{1}, t r_{2}, t r_{3}, t r_{4}, t r_{5} \in F_{< n - 1} [X]$ such that: $t r_{1} \in t r_{3}, t r_{3} ≐ t r_{4}, (t r_{2}, t r_{1}, t r_{5}) \propto (S_{σ_{1}}, S_{σ_{2}}, S_{σ_{3}}), (20)$ Following the previous section and this section, the polynomial constraint system $(20)$ gets transformed to the following one, so that for all $x \in G$ : $L_{1} (x) (Z_{1} (x) - 1) = 0, Z_{1} (gx) = Z_{1} (x) \frac{( 1 + β ) ( γ + tr _{1} ( x )) ( γ ( 1 + β ) + tr _{3} ( x ) + β tr _{3} ( gx ))}{( γ ( 1 + β ) + h _{1, 1} ( x ) + β h _{1, 2} ( x )) ( γ ( 1 + β ) + h _{1, 2} ( x ) + β h _{1, 1} ( gx ))} L_{1} (x) (Z_{2} (x) - 1) = 0, Z_{2} (gx) = Z_{2} (x) \frac{( γ + tr _{3} ( x ))}{( γ + tr _{4} ( x ))}, L_{1} (x) (Z_{3} (x) - 1) = 0, im_{1} (x) = (tr_{1} (x) + β k_{1} x + γ) (tr_{5} (x) + β k_{2} x + γ), im_{2} (x) = (tr_{1} (x) + S_{σ_{2}} (x) + γ) (tr_{5} (x) + S_{σ_{3}} (x) + γ), Z_{3} (gx) = Z_{3} (x) \frac{( tr _{2} ( x ) + β x + γ ) im _{1} ( x )}{( tr _{2} ( x ) + S _{σ_{1}} ( x ) + γ ) im _{2} ( x )}, (21)$ where we notice that the only type of argument that sometimes need to be adjusted is the connection argument.

7.On the Quotient Polynomial

We obtain a single random value $α \in K$ and define the quotient polynomial as a random linear combination of the rational functions $q_{i}$ as follows: $Q (X) = i = 0 \sum ℓ α^{i - 1} q_{i} (X) (22)$ Note that we use powers of a uniformly sampled value $α$ instead of sampling one value per constraint. Importantly, the soundness bound of this alternative version is linearly increased by the number of

constraints ℓ, so we might assume from now on that $ℓ$ is sublinear in $∣ K ∣$ to ensure the security of protocols.

8. Controlling the Constraint Degree with Intermediate Polynomials

In the vanilla STARK protocol, the initial set of constraints that one attest to compute the proof over is of unbounded degree. However, when one arrives at the point after computing the quotient polynomial $Q$ , it should be split into polynomials of degree lower than $n$ to ensure the same redundancy is added as with the trace column polynomials $t r_{i}$ for a sound application of the FRI protocol. In this section, we explain an alternative for this process and propose the split to happen “at the beginning” and not “at the end” of the proof computation.

Therefore, we will proceed with this approach assuming that the arguments in Section 2-4 are included among the initial set of constraints. The constraints imposed by the grand products polynomials $Z_{i}$ of multiset equalities and inclusions are of known degree: degree 2 for the former and degree 3 for the latter. Based on this information, we will propose a splitting procedure that allows for polynomial constraints up to degree 3 but will split any exceeding it.

Say the initial set of polynomial constraints $C = C_{1}, \dots ., C_{ℓ}$ contain a constraint of total degree greater or equal to 4. For instance, say that we have $C = C_{1}, C_{2}$ with: $C_{1} (X_{1}, X_{2}, X_{3}, X_{1}^{'}, X_{2}^{'}, X_{3}^{'}) = X_{1} \cdot X_{2} \cdot X_{2}^{'} \cdot X_{3}^{'} - X_{3}^{3}, C_{2} (X_{1}, X_{2}, X_{3}, X_{1}^{'}, X_{2}^{'}, X_{3}^{'}) = X_{2} - 7 \cdot X_{1}^{'} + X_{3}^{'}, (23)$ Now, instead of directly computing the (unbounded) quotient polynomial $Q$ and then doing the split, we will follow the following process:

Split the constraints of degree $t \geq 4$ into $⌈ t /3 ⌉$ constraints of degree lower or equal than 3 through the introduction of one formal variable and one constraint per split.
Compute the rational functions $q_{i}$ . Notice the previous step restricts the degree of the $q_{i}$ ‘s to be lower than 2 $n$ .
Compute the quotient polynomial $Q \in F_{< 2 n} [X]$ and then split it into (at most) two polynomials $Q_{1}$ and $Q_{2}$ of degree lower than $n$ as follows: $Q (X) = Q_{1} (X) + X^{n} \cdot Q_{2} (X) (24)$ where $Q_{1}$ is obtained by taking the first $n$ coefficients of $Q$ and $Q_{2}$ is obtained by taking the last $n$ coefficients (filling with zeros if necessary).

Remark 2*.* Here, we might have that $Q_{2}$ is identically equal to 0. This is in contrast with the technique used for the split in Vanilla STARK, where the quotient polynomial $Q$ is distributed uniformly across each of the trace quotient polynomials $Q_{i}$ .

This process will “control” the degree of $Q$ so that it will be always of a degree lower than 2 $n$ .

Following with the example in Eq. $(23)$ , we rename $C_{2}$ to $C_{3}$ and introduce the formal variable $Y_{1}$ and the constraint: $C_{2} (X_{1}, X_{2}, X_{3}, X_{1}^{'}, X_{2}^{'}, X_{3}^{'}) = X_{1} \cdot X_{2} - Y_{1} (25)$ Now, to compute the rational functions $q_{i}$ , we have to compose $C_{2}$ not only with the trace column polynomials $tr_{i}$ but also with additional polynomials corresponding with the introduced variables $Y_{i}$ . We will denote these polynomials as $im_{i}$ and refer to them as intermediate polynomials.

Hence, the set of constraints in $(23)$ gets augmented to the following set: $C_{1} (X_{1}, X_{2}, X_{3}, X_{1}^{'}, X_{2}^{'}, X_{3}^{'}, Y_{1}) = Y_{1} \cdot X_{2}^{'} \cdot X_{3}^{'} - X_{3}^{3}, C_{2} (X_{1}, X_{2}, X_{3}, X_{1}^{'}, X_{2}^{'}, X_{3}^{'}, Y_{1}) = X_{1} \cdot X_{2} - Y_{1}, C_{3} (X_{1}, X_{2}, X_{3}, X_{1}^{'}, X_{2}^{'}, X_{3}^{'}, Y_{1}) = X_{2} - 7 \cdot X_{1}^{'} + X_{3}^{'}, (26)$ where we include the variable $Y_{1}$ in $C_{3}$ for notation simplicity. Note that now what we have is two constraints of degree lower than 3, but we have added one extra variable and constraint to take into account.

Discussing more in-depth the tradeoff generated between the two approaches, we have for one side that deg( $Q$ ) = $max_{i} {deg (q_{i})} = max_{i} {deg (C_{i}) (n - 1) - ∣ G ∣}$ . Denote by $i_{max}$ the index of the $q_{i}$ where the maximum is attained. Then, the number of polynomials $S$ in the split of $Q$ is equal to: $⌈ \frac{deg ( Q )}{n} ⌉ = ⌈ \frac{deg ( C _{i_{max}} ) ( n - 1 ) - ∣ G ∣}{n} ⌉ = deg (C_{i_{max}}) + ⌈ - \frac{∣ G ∣}{n} ⌉$ which is equal to either deg( $C_{i_{max}}$ ) − 1 or deg( $C_{i_{max}}$ ).

We must compare this number with the number of additional constraints (or polynomials) added in our proposal. So, on the other side, we have that the overall number ofconstraints $ℓ$ is: $i = 1 \sum ℓ ⌈ \frac{deg ( C _{i} )}{3} ⌉$ With $ℓ \geq ℓ$ .

We conclude that the appropriate approach should be chosen based on the minimum value between $ℓ \geq ℓ$ and $S$ .

Example 3. To give some concrete numbers, let us compare both approaches using the following set of constraints: $C_{1} (X_{1}, X_{2}, X_{3}, X_{4}, X_{1}^{'}) = X_{1} \cdot X_{2}^{2} \cdot X_{3}^{4} \cdot X_{4} - X_{1}^{'}, C_{2} (X_{1}, X_{2}, X_{3}) = X_{1} \cdot X_{2}^{3} + X_{3}^{3}, C_{3} (X_{2}, X_{3}, X_{4}, X_{2}^{'}) = X_{2}^{3} \cdot X_{3} \cdot X_{4} + X_{2}^{'}, (27)$ In the vanilla STARK approach, we obtain $S = 8$ . On the other side, using the early splitting technique explained before, by substituting $X_{1} \cdot X_{2}^{2}$ by $Y_{1}$ and $X_{2} \cdot X_{3} \cdot X_{4}$ by $Y_{2}$ we transform the previous set of constraints into an equivalent one having all constraints of degree less or equal than 3. This reduction only introduces 2 additional constraints: $C_{1} (X_{1}^{'}, Y_{1}, Y_{2}) = Y_{1}^{2} \cdot Y_{2} - X_{1}^{'}, C_{2} (X_{2}, X_{3}, Y_{1}) = Y_{1} \cdot X_{2} + X_{3}^{3}, C_{3} (X_{2}, X_{2}^{'}, Y_{2}) = Y_{2} \cdot X_{2}^{2} + X_{2}^{'}, C_{4} (X_{1}, X_{2}, Y_{1}) = Y_{1} - X_{1} \cdot X_{2}^{2}, C_{5} (X_{2}, X_{3}, X_{4}, Y_{2}) = Y_{2} - X_{2} \cdot X_{3} \cdot X_{4}, (28)$ Henceforth, the early splitting technique is convenient in this case, introducing 3 new polynomials instead of the 7 that proposes the vanilla STARK approach.

ZK Book