Has anybody worked on Open Source Apache Hadoop based Data Lake integrated with external Python 3.x version?
We have the Data Lake with default Python 2.7 version . We however, want to use the latest version of Python so trying to find a way to connect the same. Also, is there a way we utilise external Python for Processing but consume memory from Data Lake?
Our Python server isnt big enough to handle multiple jobs, hence this question.