Lindel is a Logistic regression model for accurate indel prediction induced by Cas9 cleavage. It takes 65 bp sequence (clevage site at 30) as an input and predicts the frequencies for all possible deletions <30 bp, all 1-2 bp insertions, and insertions larger than 2 bp as a group.
The scheme below showes the structure of Lindel modeling. It takes the sequence and extracting Sequence features (one-hot encoded 20bp target sequence) and Microhomology features (all possible 1-4 bp microhomology tracts from the input sequencee). The model will use these features to make three predictions:
Predicted insertion and deletion frequencies will be weighted by the overall insertion and deletion ratio predicted in part 1.
For more information, please check out our paper(Wei Chen, et al.)