Sunday, October 29, 2023
Yahoo Finance Futures Contracts Historical Data
Saturday, October 7, 2023
Python: Regularities at beginning/end of month
Wednesday, September 6, 2023
First Trading Day Of September - S&P 500
Saturday, August 12, 2023
Sunday, June 18, 2023
Download Data in Parquet Format
Here you can see how to download a file from Yahoo Finance and save it both in csv and Parquet format. Note that Parquet efficiently compress data to about 60% of its original size. #Python #pythonprogramming
This is also a very good article about How To Efficiently Write Data To Parquet Format.
4 Ways to Write Data To Parquet With Python: A Comparison
Lies and Statistics
“There are three types of lies: lies, damn lies, and statistics…” –Benjamin Disraeli (1804–1881), Prime Minister of Great Britain (1874–1880) #quoteoftheday #quotes #InvestingQuotes @QuantScraper
Thursday, June 15, 2023
Python: Parquet - optimized for big data processing
In Python, Parquet is a columnar storage file format that is designed for efficient data storage and processing. It is optimized for use with big data processing frameworks, such as Apache Hadoop and Apache Spark, but can also be used in standalone Python applications.
The Parquet format offers several advantages over traditional row-based file formats, such as CSV or JSON, especially when working with large datasets:
Columnar storage: Parquet stores data column-wise rather than row-wise. This columnar organization allows for more efficient compression and encoding, as similar data values are stored together, reducing storage space and improving query performance.
Compression: Parquet supports various compression algorithms, such as Snappy, Gzip, and LZO. Compression helps to reduce the size of the data files, resulting in faster I/O operations and lower storage requirements.
Predicate pushdown: Parquet supports predicate pushdown, which means that when executing queries, it can skip reading entire columns or row groups based on the query predicates. This capability improves query performance by minimizing disk I/O.
Schema evolution: Parquet files can handle schema evolution, allowing for flexibility in adding, modifying, or deleting columns from the dataset without the need to rewrite the entire dataset.
To work with Parquet files in Python, you can use libraries like pyarrow
or pandas
that provide convenient APIs for reading and writing Parquet data. These libraries offer methods for converting data between Parquet files and other data structures like DataFrames, enabling seamless integration with existing Python data processing workflows.
Yahoo Finance Futures Contracts Historical Data
Futures data downloaded from yahoo finance are not adjusted as continuous contracts. When you download futures data from Yahoo Finance or ma...
-
I am thrilled with my TradeStation desktop setup! 💻 📈 Advanced charting tools, lightning-fast order execution, and a personalized workspa...
-
Frustration sets in when overfitting provides results promising on historical data, but that fail in real world. The right balance between ...
-
Here you can see how to download a file from Yahoo Finance and save it both in csv and Parquet format. Note that Parquet efficiently compre...