Adjust the root paths in train_donut.py according to your setup. Data will be downloaded and preprocessed automatically when running for the first time. This will take some time.