When researching about PQ KEM's I have come across two different definitions of indistinguishability under (adaptive) chosen ciphertext attack. IND-CCA (https://eprint.iacr.org/2017/604.pdf page 10, Kyber - https://eprint.iacr.org/2017/634.pdf page 3) and IND-CCA2 (https://eprint.iacr.org/2002/174.pdf page 4), the security experiments differ only in that the adversary, in the IND-CCA2 case, has an access to the decapsulation oracle also before he receives the challenge.
IND-CCA (cannot query $c^*$ to Decaps oracle):
$$
(sk, pk) \leftarrow Gen()\\
(c^*, K_0) \leftarrow Encaps(pk), K_0 \in \mathbb{K}\\
K_1 \stackrel{$}{\leftarrow} \mathbb{K} \\
b \stackrel{$}{\leftarrow} \{0,1\}\\
b' \leftarrow A^{Decaps(sk,\cdot)}(c^*, K_b)
$$
IND-CCA2 (can query anything to Decaps oracle before receiving $c^*$, then cannot query $c^*$ to Decaps oracle):
$$
(sk, pk) \leftarrow Gen()\\
s \leftarrow A^{Decaps(sk,\cdot)}()\\
(c^*, K_0) \leftarrow Encaps(pk), K_0 \in \mathbb{K}\\
K_1 \stackrel{$}{\leftarrow} \mathbb{K} \\
b \stackrel{$}{\leftarrow} \{0,1\}\\
b' \leftarrow A^{Decaps(sk,\cdot)}(s, c^*, K_b)
$$
$s$ is supposed to signify some information from the first stage - maybe a list of queries made.
Is there any easy explanation how do these two definitions actually differ? What is the advantage the adversary gets by being able to decrypt ciphertexts before he receives the challenge ciphertext (which he then cannot decrypt using the oracle).
The only advantage I can think of is that if in the first stage the adversary guesses the ciphertext that is going to be generated by the challenger, then he can win the game since he knows (assuming he can save it) that this ciphertext has already been queried and decrypts to key $K$. This probability is of course negligible (assuming the keyspace is large and he is polynomial) and therefore the advantage difference is negligible?
NIST in their call for proposals mentions IND-CCA2 definition (doesn't define it). Kyber in their NIST submission call the KEM IND-CCA2 with a reference to the paper linked above where the definition is IND-CCA. Is this just a naming confusion?
Am I missing something and is there any real difference between these two definitions?