fetch_dominick¶
yohou.datasets._fetchers.fetch_dominick(*, n_series=50, data_home=None, download_if_missing=True, n_retries=3, delay=1.0)
¶
Fetch the Dominick dataset from Monash/Zenodo.
Weekly time series representing the profit of individual stock keeping units from a retailer (Dominick's Finer Foods). The full dataset contains 115 704 series; by default only the first 50 are loaded to keep memory usage reasonable.
Parameters¶
| Name | Type | Description | Default |
|---|---|---|---|
n_series
|
int or None
|
Maximum number of series to include. |
50
|
data_home
|
str, PathLike, or None
|
Specify another download and cache folder for the datasets.
By default all yohou data is stored in |
None
|
download_if_missing
|
bool
|
If |
True
|
n_retries
|
int
|
Number of retries when HTTP errors are encountered. |
3
|
delay
|
float
|
Number of seconds between retries. |
1.0
|
Returns¶
| Type | Description |
|---|---|
Bunch
|
Dictionary-like object with the following attributes: frame : pl.DataFrame
DataFrame with |
See Also¶
fetch_tourism_monthly: Monthly tourism series.fetch_hospital: Monthly hospital patient count series.get_data_home: Return the path of the data directory.
References¶
[1] Godahewa, R., Bergmeir, C., Webb, G. I., Hyndman, R. J., & Montero-Manso, P. (2021). "Monash Time Series Forecasting Archive." Neural Information Processing Systems Track on Datasets and Benchmarks. https://doi.org/10.5281/zenodo.4654802
Examples¶
>>> from yohou.datasets import fetch_dominick
>>> bunch = fetch_dominick()
>>> bunch.frame.columns[:2]
['time', 'T1__profit']
Source Code¶
Show/Hide source
550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 | |