It may be that we have a model where the following equation holds for some phenomenon:
$$(1)\quad x + y + z = T$$
Importantly, $T$ is a constant, i.e.: $$(2) \quad \frac{\mathrm{d}T(t)}{\mathrm{d}t} = 0$$
We may be interested in the evolution of $x$, $y$ and $z$ as governed by their differential equations. If we want to make sure that we maintain condition $(2)$ (not guaranteed for any numerical integration scheme), then we may represent the problem as follows:
$$ (3a)\quad \frac{\mathrm{d}x}{\mathrm{d}t} = f(x, y, z, t)$$
$$ (3b)\quad \frac{\mathrm{d}y}{\mathrm{d}t} = g(x, y, z, t)$$
$$ (3c)\quad \frac{\mathrm{d}z}{\mathrm{d}t} = \frac{\mathrm{d}T(t)}{\mathrm{d}t}- f(x,y,z,t) - g(x, y, z, t) = - f(x,y,z,t) - g(x, y, z, t)$$
So, some program might calculate the evolution of $x$, $y$ and $z$ by numerical integrating $x$ and $y$, while using condition $(2)$ to determine $z$ (generally, abd during the calculations for $x$ and $y$).
One takeaway I have had from a numerical analysis course I was a part of recently is that "if you're not violating some conservation rule in the phenomenon being modelled, then you have the exact solution" (and numerical solutions are not exact solutions, usually). Thus, even though I might feel nice about saving $(2)$, I know I am losing something somewhere else -- what is it that I am losing?
An idea I have have is that $(3)$ essentially allows a numerical integration scheme to assume that any difference in $T$ is solely made up by $z$, rather than by some combination of $x$, $y$ and $z$? Or, put differently, any error in the numerical integration scheme in violating $(2)$ is chalked up to $z$, so we may not be getting the actual dynamics of $z$, but rather, the dynamics of $z$ along with some error based on how $(2)$ is being violated by the numerical scheme?
How might one more precisely explain what the downside of using a DAE system is in terms of "what is lost in approximation"?