It's a non-trivial amount of bandwidth that would've been instantly detected by white hats and sent to every tech journalist in the planet if they were actually doing this
Not if the speech processing algorithms are already present in the OS. It would be possible to run regression analysis locally and produce results which measure in the single digit KB amounts or smaller. Even the models themselves are rather tiny after they've been trained. For neural networks its just bunch of floating point values.