You can get resources from :
- ETL (Extract - transfert - Load) with as example Talend
- query tools (Elasticsearch ...)
- and build-in tools (R-ODBC, RJSON with R, pandas and pyodbc ... with python, pyspark ...)
with hadoop, you can use Scoop and flume. Check Apache fondation site ...
if you are using a Linux system, it is easy to check available products with apt-cache search (debian...), yum or dnf (redhat ...) You have to be an administrator.
good luck !