Skip to content
This repository has been archived by the owner on Mar 1, 2024. It is now read-only.

Adding Earning Call transcripts of US based companies #658

Merged
merged 36 commits into from
Nov 28, 2023

Conversation

Athe-kunal
Copy link
Contributor

Description

This loader will help users get the earning call transcripts of different US-based companies. Also, added a reviews folder argument for the IMDB movie reviews loader

Fixes # (issue)

Type of Change

Please delete options that are not relevant.

  • New Loader/Tool
  • Bug fix / Smaller change

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

  • I stared at the code and made sure it makes sense
  • I tested the code with multiple tickers

Suggested Checklist:

  • I have added a library.json file if a new loader/tool was added
  • I have performed a self-review of my own code
  • I have commented my code, particularly in hard-to-understand areas
  • I have made corresponding changes to the documentation
  • My changes generate no new warnings
  • I have added tests that prove my fix is effective or that my feature works
  • New and existing unit tests pass locally with my changes
  • I ran make format; make lint to appease the lint gods

@Athe-kunal
Copy link
Contributor Author

It is complaining about the library.json addition, which I have added in the recent commit.

Copy link
Collaborator

@jerryjliu jerryjliu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is a bit of a nit, but it's technically "earnings call" and not "earning call". can we class names/variables to reflect this?

```python
from llama_index import download_loader

IMDBReviewsloader = download_loader('EarningCallTranscripts')
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: wrong name

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @jerryjliu, thanks for your input
Yes I have done the nit changes and README ones too

@@ -225,7 +226,7 @@ def main_scraper(

print(f"Number of reviews scraped: {len(reviews_date)}")
if generate_csv:
os.makedirs("movie_reviews", exist_ok=True)
os.makedirs(reviews_folder, exist_ok=True)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we need to make this change in this PR? if there's imdb changes let's make that in a sep PR

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I contributed imdb_scraper sometime back, but this was just a minor change that does not change much. For making it a separate PR, do I have to undo the changes and commit again, or is there any other way? Sorry if this is a stupid question

#### Llama Index
```python
from llama_index import download_loader
from llama_index import GPTVectorStoreIndex, download_loader
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

GPTVectorStoreIndex is outdated, it's now just VectorStoreIndex

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes I have done this change, thanks

@jerryjliu jerryjliu merged commit ccff594 into run-llama:main Nov 28, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants