Semi-supervised learning with unsupervised pretraining on unlabeled sequence data and supervised finetuning on labeled sequence data