Kerberos support¶
Most of Hadoop instances set up with Kerberos support, so some connections require additional setup to work properly.
HDFSUses requests-kerberos and GSSApi for authentication. It also useskinitexecutable to generate Kerberos ticket.HiveandSparkHDFSrequire Kerberos ticket to exist before creating Spark session.
So you need to install OS packages with:
krb5libs- Headers for
krb5 gccor other compiler for C sources
The exact installation instruction depends on your OS, here are some examples:
apt install libkrb5-dev krb5-user gcc # Debian-based
dnf install krb5-devel krb5-libs krb5-workstation gcc # CentOS, OracleLinux
Also you should pass kerberos to extras to install required Python packages:
pip install "onetl[kerberos]"