google-generativeai streamlit scikit-learn nltk PyPDF2 transformers