Conversation
|
Small tweak: "The Impresso Python library designed to.." |
|
Sorry to mangle in. For me this looks more like a "Quickstart" not "Glance" which is typically more conceptual (which we should provide as well.) I would not use the word "page" here, as in the context of newspapers "page" has another dominant meaning. I would make it even more "quick and condensed". Search the archiveresults = client.search.find(term="moon landing")
resultsTo view the full DataFrame: results.dfRetrieve results in batchesSearch results are returned in batches. By default, only the first batch is displayed. Use import pandas as pd
total_results = 2000
limit = 1000
all_results = []
for offset in range(0, total_results, limit):
results = client.search.find(
term="Titanic",
order_by="-date",
limit=limit,
offset=offset,
)
all_results.append(results.df)
full_results_df = pd.concat(all_results, ignore_index=True)
full_results_dfGet a content item by IDitem = client.content_items.get("NZG-1877-10-20-a-i0024")
itemTranscript text is available in Open a content item in the web app |
|
Hi @simon-clematide thanks very much for your feedback. Your suggestion looks great. I would just avoid the subtitle 'get content item by ID' as it says nothing to the new user. Content item is not defined here. The whole point of this part is to make it very clear, from the beginning, where the transcripts are hidden. For the rest, it reads well in the more concise version. Thank you! Regarding 'batches', I understand the point of avoiding the word page. But here it's used the term 'pagination' (https://impresso.readthedocs.io/en/latest/result/#pagination-information). Would you advice we change the word pagination throughout the entire python library documentation? |
|
@caiocmello I would not change pagination to batches in technical documentation, but just avoid the bare word "page" (as I think you already did now). May we can call it "pagination batches" in the quickstart. This connects then the more technical term. |
|
Comment from Roman: Update pagination code with better option: Add 'warning banner' informing users of monthly limit of 200.000 (double-check limit). Eg. be careful when using concat... |
I've added a section 'a glance' to the python library index file. It appears here (https://impresso.readthedocs.io/en/latest/). Could you please revise and let me know if there is missing information that could be added?