Exploiting Machine Learning For Improving In-Memory Execution of Data-Intensive Workflows on Parallel Machines