Snippet Wednesday - Improve app performance by running calculations in parallel 🧵

mslootweg · 29 August 2023 17:52

Hi all,

Inspired by the post made by @kvangiessen for last week’s Snippet Wednesday, I decided to share a snippet this week on a case where I had to speed up calculations dramatically to make an app worthwhile. In short, I achieved this by running the calculations in parallel.

The Snippet

Let’s say you have an analysis or calculation that you would like to run. As example I will use a simple calculation that includes some equation, and a sleep function to simulate the delay that one would typically have with a long duration calculation:

Blocking method

import random
import time

def blocking_method(name: str, x: float, y: float):
    sleep_time = random.randrange(1, 5)
    print(f"Starting analysis {name}, sleeping for {sleep_time}\n")
    time.sleep(sleep_time)
    z = x ** 2 + y ** 2
    print(f"Completed analysis {name} \n")
    return z

If you were to run a 100 or so of these calculations, that would take quite some time to complete. This can be shortened dramatically by running all of these in parallel. Running all cases in parallel is usually not feasible due to constraints such as computational resources or number of licenses (in the case of calculating with licensed third party software). Therefore, you need to batch the scenarios or cases to a number that fits the constraints. Here is the snippet for running the cases in batches:

Batch run method

from typing import List, Tuple
import asyncio
from concurrent.futures import ThreadPoolExecutor


async def non_blocking_batch_run(models_input: List[dict]) -> Tuple[dict]:
    """
    :param models_input: [{"name": <name>, "x": <x>, "y": <y>}, ...]
    """
    loop = asyncio.get_running_loop()
    # define the maximum number of operations possible per batch
    with ThreadPoolExecutor(max_workers=3) as pool:
        # The method `blocking_method` runs in parallel within this asynchronous method.
        results = await asyncio.gather(*[loop.run_in_executor(pool, blocking_method, *run.values()) for run in models_input])
    return results

In the case where you are limited by licenses, you could simply define your max_workers to the number of licenses available. In the case where you are limited by computation resources, you will have to test with the number of parallel running cases, increasing the cases until you reach the limit of the resources. Make sure to use representative cases when doing these tests.

Take note: the snippet converts a dictionary to positional arguments. This assumes that the dictionary is ordered.

To round off all of this, the concurrent code needs to be run using the asyncio.run function. Here is a snippet that shows example input with the run function:

Test batch method

test_models_input = [
            {'name': 'Case 1', 'x': 1, 'y': 1},
            {'name': 'Case 2', 'x': 2, 'y': 1},
            {'name': 'Case 3', 'x': 1, 'y': 3},
]
test_results = asyncio.run(non_blocking_batch_run(test_models_input))

Results

Running the example provided above, I get the following results:

The results generated above was based on three cases run in batch where the maximum number of workers was 3. This allowed for all three cases to be run in parallel.

Above’s example shows that the time it takes to run the slowest calculation is almost equal to the batch run. That is quite exciting, as this provides many opportunities for developers to improve their applications!

Project Description

Sometimes community members are curious on what type of projects we applied these snippets. Here I’ll give a short description of the project I had to optimize:

The project was an extension on a development where sheetpiles could be designed parametrically, by also using integrated third party software. This was already valuable for the engineers, but they wanted to take the next step: optimizing sheetpile designs. For this they wanted to calculate many possible sheetpile designs, and based on the results, find a suitable optimal solution. The big problem was that it took quite some time to do such an optimization. But what if one were able to run many different scenarios parallel with one another? Well, with this, I introduced this Snippet Wednesday snippet to the project!

There have been other projects that also applied this concept, although it may have differed, depending on the Python version and project requirements. It would be great to see some other examples of projects posted in this topic’s chat.

Credits and Sources

This is a snippet that was initiated back in the Jurassic era by my colleague @rdejonge (just joking, it was only 4 years ago). Therefore, credit where credit’s due.

Also, this snippet by no means explains the technicalities of multithreading. This was also not the intention, as I merely wanted to convey the concept in a way that can be applied in little time. I hope I have done that in good manner. For those that are interested a deeper dive🤿, here are some links (although a simple Google search would give you good results as well):

mslootweg · 29 August 2023 17:59

I’ve also converted the code into an app! So for those who are interested in playing around with it, simply copy and paste this code in your app.py file, and then run the application!

import time
import random
from typing import List, Tuple
import asyncio
from concurrent.futures import ThreadPoolExecutor

import plotly.graph_objects as go

from viktor import ViktorController
from viktor.parametrization import ViktorParametrization, NumberField, DynamicArray, TextField, Text
from viktor.views import PlotlyView, PlotlyResult


async def non_blocking_batch_run(models_input: List[dict]) -> Tuple[dict]:
    """
    :param models_input: [{"name": <name>, "x": <x>, "y": <y>}, ...]
    """
    loop = asyncio.get_running_loop()
    # define the maximum number of operations possible per batch
    with ThreadPoolExecutor(max_workers=3) as pool:
        # The method `blocking_method` runs in parallel within this asynchronous method.
        results = await asyncio.gather(*[loop.run_in_executor(pool, blocking_method, *run.values()) for run in models_input])
    return results


def blocking_method(name: str, x: float, y: float) -> dict:
    sleep_time = random.randrange(1, 5)
    print(f"Starting analysis {name}, sleeping for {sleep_time}\n")
    time.sleep(sleep_time)
    z = x ** 2 + y ** 2
    print(f"Completed analysis {name} \n")
    return {'name': name, 'value': z, 'time': sleep_time}


DEFAULT_ARRAY = [
    {'name': 'Case 1', 'x': 1, 'y': 1},
    {'name': 'Case 2', 'x': 2, 'y': 1},
    {'name': 'Case 3', 'x': 1, 'y': 3},
]


class Parametrization(ViktorParametrization):
    intro = Text(""" # Welcome to the Multithreading investigation app! 🧵

This app provides the user the possibility to investigate how multithreading can be used to improve one's app's performance.

Start by creating a few cases. A row represents a case.
""")
    array = DynamicArray('Cases to run', default=DEFAULT_ARRAY, row_label='Case')
    array.name = TextField('Name')
    array.x = NumberField('x')
    array.y = NumberField('y')


class Controller(ViktorController):
    label = 'Multithread investigation'
    parametrization = Parametrization

    @staticmethod
    def optimize_scenarios(params):
        results = asyncio.run(non_blocking_batch_run(params.array))
        return results

    @PlotlyView("Multithreading investigation", duration_guess=10)
    def get_multithreading_results_view(self, params, **kwargs):
        t = time.time()
        results = self.optimize_scenarios(params)
        total = time.time() - t
        x = [result['name'] for result in results] + ['total (parallel)', 'total (series)']
        y = [result['time'] for result in results] + [total, sum([result['time'] for result in results])]
        colors = ['#1E90FF' for _ in results] + ['#FFCC33', '#14142B']
        fig = go.Figure(
            data=[go.Bar(x=x, y=y, marker_color=colors)],
            layout=go.Layout(title=go.layout.Title(text="An investigation of multithreading visualized"))
        )
        return PlotlyResult(fig.to_json())

mweehuizen · 30 August 2023 07:50

Wow, very nice Marcel. Thanks for sharing.

matthijs · 30 August 2023 08:19

Very nice Marcel. Cool to see that this approach enables true parallelism over the workers, something that is normally not possible in a single process python program because of the GIL.

Some additional background info for those interested:

threading is just one concept for concurrency/parallelism in python. this article lists the possibilities and considerations: Speed Up Your Python Program With Concurrency – Real Python
python core developers are considering to enable removing the GIL and facilitating single core parallelism: PEP 703 – Making the Global Interpreter Lock Optional in CPython | peps.python.org

Daniel · 4 September 2023 15:07

Super cool! Thanks for the tips!!!