Background
Functions in $AC^0$ are PAC learnable in quasipolynomial time with a classical algorithm that requires $O(2^{log(n)^{O(d)}})$ randomly chosen queries to learn a circuit of depth d [1]. If there is no $2^{n^{o(1)}}$ factoring algorithm then this this is optimal [2]. Of course, on a quantum computer we know how to factor, so this lower bound does not help. Further, the optimal classical algorithm uses the Fourier spectrum of the function thus screaming "quantumize me!"
[1] N. Linial, Y. Mansour, and N. Nisan. [1993] "Constant depth circuits, Fourier transform, and learnability", Journal of the ACM 40(3):607-620.
[2] M. Kharitonov. [1993] "Cryptographic hardness of distribution-specific learning", Proceedings of ACM STOC'93, pp. 372-381.
In fact, 6 years ago, Scott Aaronson put the learnability of $AC^0$ as one of his Ten Semi-Grand Challanges for Quantum Computing Theory.
Question
My question is three-fold:
1) Are there examples of natural function families that quantum computers can learn faster than classical computers given cryptographic assumptions?
2) Has there been any progress on the learnability of $AC^0$ in particular? (or the slightly more ambitious $TC^0$)
3) In regards to the learnability of $TC^0$, Aaronson comments: "then quantum computers would have an enormous advantage over classical computers in learning close-to-optimal weights for neural networks." Can somebody provide a reference on how weight updating for neural nets and $TC^0$ circuits relate? (apart from the fact that threshold gates look like sigmoid neurons) (This question was asked and answered already)