Partially Shuffling the Training Data to Improve Language Models - Citation Graph | Papersgraph