add a glance to library index by caiocmello · Pull Request #59 · impresso/impresso-py

caiocmello · 2026-04-20T12:06:44Z

I've added a section 'a glance' to the python library index file. It appears here (https://impresso.readthedocs.io/en/latest/). Could you please revise and let me know if there is missing information that could be added?

mduering · 2026-04-20T12:39:41Z

Small tweak: "The Impresso Python library designed to.."

simon-clematide · 2026-04-20T12:54:50Z

Sorry to mangle in. For me this looks more like a "Quickstart" not "Glance" which is typically more conceptual (which we should provide as well.) I would not use the word "page" here, as in the context of newspapers "page" has another dominant meaning.

I would make it even more "quick and condensed".
(beware of markdown in markdown artefacts below

## Quickstart

### Create a session

```python
from impresso import connect

client = connect()

Search the archive

results = client.search.find(term="moon landing")
results

To view the full DataFrame:

results.df

Retrieve results in batches

Search results are returned in batches. By default, only the first batch is displayed. Use limit and offset to retrieve additional results.

import pandas as pd

total_results = 2000
limit = 1000
all_results = []

for offset in range(0, total_results, limit):
    results = client.search.find(
        term="Titanic",
        order_by="-date",
        limit=limit,
        offset=offset,
    )
    all_results.append(results.df)

full_results_df = pd.concat(all_results, ignore_index=True)
full_results_df

Get a content item by ID

item = client.content_items.get("NZG-1877-10-20-a-i0024")
item

Transcript text is available in text.content.

Open a content item in the web app

https://impresso-project.ch/app/article/{id}

caiocmello · 2026-04-20T13:17:50Z

Hi @simon-clematide thanks very much for your feedback. Your suggestion looks great. I would just avoid the subtitle 'get content item by ID' as it says nothing to the new user. Content item is not defined here. The whole point of this part is to make it very clear, from the beginning, where the transcripts are hidden. For the rest, it reads well in the more concise version. Thank you!

Regarding 'batches', I understand the point of avoiding the word page. But here it's used the term 'pagination' (https://impresso.readthedocs.io/en/latest/result/#pagination-information). Would you advice we change the word pagination throughout the entire python library documentation?

simon-clematide · 2026-04-20T13:28:40Z

@caiocmello I would not change pagination to batches in technical documentation, but just avoid the bare word "page" (as I think you already did now). May we can call it "pagination batches" in the quickstart. This connects then the more technical term.

caiocmello · 2026-04-21T14:46:09Z

Comment from Roman:

Update pagination code with better option:

import pandas as pd
# Get first page with 50 items per page
results = impresso.search.find(term="revolution", limit=50)
df = results.df

# Iterate through all pages
for page in results.pages():
    print(f"Processing page at offset {page.offset}")
    print(f"Contains {page.size} items")
    df = pd.concat([df, page.df])

Add 'warning banner' informing users of monthly limit of 200.000 (double-check limit). Eg. be careful when using concat...

add a glance to library index

4bdedeb

caiocmello requested review from caovy-univers, e-maud and theorm April 20, 2026 12:06

caiocmello assigned danieleguido and caiocmello Apr 20, 2026

theorm added 2 commits April 22, 2026 09:22

updated changes

d6670d3

added links to necessary resources in documentation

a3b9006

theorm approved these changes Apr 22, 2026

View reviewed changes

brought back the deleted connect section

3011008

theorm merged commit 192f907 into main Apr 22, 2026
2 checks passed

theorm deleted the indexupdate-aglance branch April 22, 2026 07:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add a glance to library index#59

add a glance to library index#59
theorm merged 4 commits intomainfrom
indexupdate-aglance

caiocmello commented Apr 20, 2026

Uh oh!

mduering commented Apr 20, 2026

Uh oh!

simon-clematide commented Apr 20, 2026 •

edited

Loading

Uh oh!

caiocmello commented Apr 20, 2026

Uh oh!

simon-clematide commented Apr 20, 2026

Uh oh!

caiocmello commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

caiocmello commented Apr 20, 2026

Uh oh!

mduering commented Apr 20, 2026

Uh oh!

simon-clematide commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Search the archive

Retrieve results in batches

Get a content item by ID

Open a content item in the web app

Uh oh!

caiocmello commented Apr 20, 2026

Uh oh!

simon-clematide commented Apr 20, 2026

Uh oh!

caiocmello commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

simon-clematide commented Apr 20, 2026 •

edited

Loading