Break down the steps of training a model and it quickly becomes apparent why it's technically wrong to call this a copyright infringement. First, the act of making transient copies of works - even billions of works - is unequivocally fair use. Unless you think search engines and the @internetarchive shouldn't exist, then you should support scraping at scale: