i think your comment actually mostly makes sense, except the part about neural network guys needing to familiarize with Chomsky, which is not the case at all
Ergo, my initial claim that "modern approaches have zero overlap with Chomsky's deterministic methodology." Statistical token prediction began with the Dragon folks, the CMU guys, and Yorktown Heights, many of whom encountered Chomsky formalism as undergrads.
german_dong|2 months ago