1

I wonder how can I share the value of variables between HTTP requests in FastAPI. For instance, I have a POST request in which I get some audio files and then I convert their info into a Pandas Dataframe. I would like to send that Dataframe in a GET request, but I can't access the Dataframe on the GET request scope.

@app.post(
    path="/upload-audios/",
    status_code=status.HTTP_200_OK
)
async def upload_audios(audios: list[UploadFile] = File(...)):
    filenames = [audio.filename for audio in audios]
    audio_data = [audio.file for audio in audios]
    new_data = []
    final_data = []
    header = ["name", "file"]
    for i in range(len(audios)):
        new_data = [filenames[i], audio_data[i]]
        final_data.append(new_data)
    new_df = pd.DataFrame(final_data, columns=header)
    return f"You have uploaded {len(audios)} audios which names are: {filenames}"

@app.get("/get-dataframe/")
async def get_dataframe():
    pass
Chris
  • 4,940
  • 2
  • 7
  • 28
nyibf_
  • 304
  • 2
  • 10
  • Store the requested data in a storage solution - like redis, sqlite, on disk, rdbms - wherever, then read it and create the dataframes when the user requests them. You'll also need to return them in a format that FastAPI can serialize properly. – MatsLindh Feb 25 '22 at 11:35
  • @MatsLindh, so I need a database. But if I dont want to use a memory mechanism, the same thing could be done with python context variables? – nyibf_ Feb 25 '22 at 19:18
  • 2
    You could store it in-memory in your process - as long as you never expect to serve more than one user, and don't plan on having multiple workers active at the same time (which would have their own memory, so the worker handling the get would not necessarily be the same as the one handling the post). Do keep the data in-process, declare a dictionary outside of the functions, then assign to a key inside the dictionary inside the function - `foo = {}` at the top, then `foo['pd'] = ..` inside your functions. – MatsLindh Feb 25 '22 at 22:57

1 Answers1

0

If you need read-only access to that variable, and/or you never expect it to be changed by some other request before reading it (in other words, you never expect to serve more than one client), as well as your app does not use several workers at the same time (where each worker has its own memory), you could either (as mentioned by @MatsLindh in the comments) declare a dictionary foo = {} outside the endpoints and assign a key to it inside the endpoint foo['pd'] = new_df (which you can later retrieve), or declare your variable as global (as described here), or, preferably, store it on the app instance. For example:

app.state.new_df = new_df 

Inside get-dataframe endpoint retrieve the new_df as:

new_df = app.state.new_df

or, if the app instance is not available in the file from which you are working (let's say you have your endpoints defined in submodules, separately from the main module, as described here), you could get the app instance from the Request object:

from fastAPI import Request
@app.get("/get-dataframe/")
async def get_dataframe(request: Request):
    return request.app.state.new_df

Otherwise, if you need that variable/object to be shared among different clients, as well as among multiple processes/workers, that may also require read/write access to it, you should rather use a database storage, such as PostgreSQL, SQLite, MongoDB, etc., or Key-Value stores (Caches), such as Redis, Memcached, etc. You may want to have a look at this answer as well.

Chris
  • 4,940
  • 2
  • 7
  • 28