feature -- adding `worksheet.get_records` to get specific row ranges #1301

AndrewBasem1 · 2023-09-21T23:11:22Z

Closes #1294

Changes

added get_records_subset to provide similar behavior to get_all_records but allows the user to chose a first_row and last_row to get the data from only.
changed get_all_records to simply call get_records_subset
added _validate_rows_ranges_for_get_records_subset to validate the user inputs for (head, first_row, last_row)
added _validate_headers_and_keys_for_get_records_subset to do the following:
1. validate expected_headers given by the user are unique
2. validate that the obtained keys contains the expected_headers
3. validate that the obtained keys are unique
added _pad_values_and_keys_for_get_records_subset to match the size of the keys and rows, to produce dictionaries as expected.
added tests for the new functions

alifeee

Thanks for this PR! You have put a lot of work into it! :)

Some questions from me:

Why update all the cassettes? You should only need to update the ones for the tests you modify/add
The new function should be named get_records, as this is simpler
The first_row and last_row (any maybe head?) should be required arguments
We do not need the methods _validate_rows_ranges_for_get_records_subset _validate_headers_and_keys_for_get_records_subet _pad_values_and_keys_for_get_records_subset. This logic can go in the function get_records

Some thoughts:

instead of "first_row" and "last_row", we could instead ask for a range (user would enter "4:9", for example. This would then make it easier to translate the function to be column-based instead of row-based (see getallrecords() – can there be a "rows" option to get attributes from rows, not columns? #808)

Thanks again for this work :)

gspread/worksheet.py

This reverts commit 14ed342.

This reverts commit 307fa22.

AndrewBasem1 · 2023-09-24T06:06:27Z

I've made a few changes, please check and let me know what you think about the current state.

Also, I'm not sure how replacing first and last_row with a range would help implement #808. I'd actually argue for renaming first_row to first_index and adding a use_index argument (this will be consistent with pandas, which I think a big part of the pythno community already uses)

gspread/worksheet.py

tests/worksheet_test.py

alifeee · 2023-09-25T12:14:05Z

I've made a few changes, please check and let me know what you think about the current state.

Thanks! Your code changes are very clear, and your tests are layed out very clearly. It is good.

Also, I'm not sure how replacing first and last_row with a range would help implement #808. I'd actually argue for renaming first_row to first_index and adding a use_index argument (this will be consistent with pandas, which I think a big part of the pythno community already uses)

This sounds good. My original thinking was that a user could select a range with "4:8" or "D:G", as one could use as indices in get_values. Then they would use a kwarg to select between indexing rows or columns first. Your solution is good to me also.

I will enable the workflow now. You may need to run the formatter with tox -e format

alifeee · 2023-09-25T12:24:34Z

Cassettes are broken. I have fixed them in https://github.com/burnash/gspread/tree/feature/get_records_limit

I could not push to https://github.com/AndrewBasem1/gspread/tree/feature/get_records_limit

Moved the `first_row` and `last_row` validations inside the method, as well as the `expected_headers` and `keys` validations.

…o feature/get_records_limit

tests/worksheet_test.py

alifeee · 2023-09-25T23:26:23Z

[...] I'd actually argue for renaming first_row to first_index and adding a use_index argument [...]

Once this is done, and the CI passes, I think this is ready to merge :)

alifeee · 2023-09-26T10:09:41Z

You will want to delete the cassette for test_get_records, and re-run the test online. Then, it looks like all should be good.

AndrewBasem1 · 2023-09-27T07:24:12Z

@alifeee you'll have to excuse my lack of expertise, it's my first time working with cassettes so I don't have a full understanding of them. hope this fixed the issue. Please check and let me know if there is anything else I can do

alifeee · 2023-09-28T11:34:21Z

Thanks again for this PR!! You have been clear, helpful, and thoughtful about your changes. You write Python well.

I am happy to merge this in.

It will be usable in the next release of gspread, which should be v5.12.0 in a couple of weeks (see milestone)

AndrewBasem1 added 8 commits September 15, 2023 00:00

adding skeleton for method

9b7c207

renaming args, and adding some sanity checks

5d5dd0c

adding main logic

f6f0658

fixing some minor issues

bd06c33

updated doctsring, added tests, and fixed issues

a356c91

rename some tests

96b74a2

adding cassettes

307fa22

Merge branch 'master' into feature/get_records_limit

5c4ca7b

AndrewBasem1 changed the title ~~Feature/get_records_limit~~ feature -- adding worksheet.get_records_subset to get specific row ranges Sep 21, 2023

AndrewBasem1 mentioned this pull request Sep 21, 2023

getallrecords() – can there be a "rows" option to get attributes from rows, not columns? #808

Open

alifeee requested changes Sep 22, 2023

View reviewed changes

AndrewBasem1 added 10 commits September 24, 2023 06:57

removing isinstance checks

99c8fa1

removing unneeded variable

4565154

making rows args mandatory

14ed342

improving padding for get_records

27eb373

renaming method to get_records

301666e

fixing broken test functions

0f668f5

Revert "making rows args mandatory"

652c497

This reverts commit 14ed342.

adding kwargs in get_all_records

12a2794

Revert "adding cassettes"

415861f

This reverts commit 307fa22.

adding needed cassettes only

a574f6b

alifeee requested changes Sep 25, 2023

View reviewed changes

gspread/worksheet.py Outdated Show resolved Hide resolved

gspread/worksheet.py Outdated Show resolved Hide resolved

tests/worksheet_test.py Outdated Show resolved Hide resolved

fix cassettes

fe43232

AndrewBasem1 added 4 commits September 25, 2023 23:37

moving validations inside the method

63d57c0

Moved the `first_row` and `last_row` validations inside the method, as well as the `expected_headers` and `keys` validations.

moving padding inside

29955c1

adding test for fill_gaps with defined value

e31ed99

adding a default value in fill gaps

aea25a8

AndrewBasem1 added 3 commits September 26, 2023 00:03

using fill gaps in method

c690971

Merge remote-tracking branch 'upstream/feature/get_records_limit' int…

0a9aa8c

…o feature/get_records_limit

aligining test_cases with comments, and adding new one

342f35f

AndrewBasem1 requested a review from alifeee September 25, 2023 21:29

ignoring function complexity checker

81c1d4a

alifeee requested changes Sep 25, 2023

View reviewed changes

tests/worksheet_test.py Show resolved Hide resolved

renaming args, and adding examples in docstring

b0ea1be

AndrewBasem1 requested a review from alifeee September 26, 2023 06:50

alifeee approved these changes Sep 26, 2023

View reviewed changes

adding new cassette

4f39bef

alifeee changed the title ~~feature -- adding worksheet.get_records_subset to get specific row ranges~~ feature -- adding worksheet.get_records to get specific row ranges Sep 28, 2023

alifeee assigned AndrewBasem1 Sep 28, 2023

alifeee added this to the 5.12 milestone Sep 28, 2023

alifeee added Feature Request Improvement labels Sep 28, 2023

alifeee merged commit 7fe63bf into burnash:master Sep 28, 2023

alifeee mentioned this pull request Oct 25, 2023

Remove unused parameter or use it #1332

Closed

alifeee mentioned this pull request Nov 5, 2023

remove use_index and references to it in get_records #1343

Merged

alifeee mentioned this pull request Nov 15, 2023

get_all_records with unique expected_headers fails in version 5.12 #1352

Closed

alifeee mentioned this pull request Dec 7, 2023

PROPOSAL: changes to get_records/get_all_records #1367

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature -- adding `worksheet.get_records` to get specific row ranges #1301

feature -- adding `worksheet.get_records` to get specific row ranges #1301

AndrewBasem1 commented Sep 21, 2023 •

edited

Loading

alifeee left a comment

AndrewBasem1 commented Sep 24, 2023

alifeee commented Sep 25, 2023

alifeee commented Sep 25, 2023

alifeee commented Sep 25, 2023

alifeee commented Sep 26, 2023

AndrewBasem1 commented Sep 27, 2023

alifeee commented Sep 28, 2023

feature -- adding worksheet.get_records to get specific row ranges #1301

feature -- adding worksheet.get_records to get specific row ranges #1301

Conversation

AndrewBasem1 commented Sep 21, 2023 • edited Loading

Changes

alifeee left a comment

Choose a reason for hiding this comment

AndrewBasem1 commented Sep 24, 2023

alifeee commented Sep 25, 2023

alifeee commented Sep 25, 2023

alifeee commented Sep 25, 2023

alifeee commented Sep 26, 2023

AndrewBasem1 commented Sep 27, 2023

alifeee commented Sep 28, 2023

feature -- adding `worksheet.get_records` to get specific row ranges #1301

feature -- adding `worksheet.get_records` to get specific row ranges #1301

AndrewBasem1 commented Sep 21, 2023 •

edited

Loading