Free dataset download
There are a variety of externally-contributed interesting data sets on the site. Kaggle has both live and historical competitions. You can download data for either, but you have to sign up for Kaggle and accept the terms of service for the competition. You can download data from Kaggle by entering a competition.
Each competition has its own associated data set. There are also user-contributed data sets found in the new Kaggle Data sets offering. Although the data sets are user-contributed, and thus have varying levels of documentation and cleanliness, the vast majority are clean and ready for machine learning to be applied. UCI is a great first stop when looking for interesting data sets. Quandl is a repository of economic and financial data. Some of this information is free, but many data sets require purchase.
Quandl is useful for building models to predict economic indicators or stock prices. Sometimes, it can be very satisfying to take a data set spread across multiple files, clean them up, condense them into one, and then do some analysis. In data cleaning projects, sometimes it takes hours of research to figure out what each column in the data set means. These types of data sets are typically found on aggregators of data sets.
These aggregators tend to have data sets from multiple sources, without much curation. Too much curation gives us overly neat data sets that are hard to do extensive cleaning on. In addition, you can upload your data to data. One key differentiator of data. Data can range from government budgets to school performance scores.
Anyone can download the data, although some data sets require additional hoops to be jumped through, like agreeing to licensing agreements. You can browse the data sets on Data. You can browse by topic area, or search for a specific data set. The World Bank is a global development organization that offers loans and advice to developing countries.
The World Bank regularly funds programs in developing countries, then gathers data to monitor the success of these programs. You can browse World Bank data sets directly, without registering. The data sets have many missing values, and sometimes take several clicks to actually get to data.
Reddit , a popular community discussion site, has a section devoted to sharing interesting data sets. You can browse the subreddit here. You can also see the most highly upvoted data sets here. Spanish news articles from the top 10, based on the ranking provided by Alexa news sites. Russian news articles from the top 10, based on the ranking provided by Alexa news sites. French news articles from the top 10, based on the ranking provided by Alexa news sites.
English news articles originated in the US from the top 1, based on the ranking provided by Alexa news sites. Chinese news articles from the top 10, based on the ranking provided by Alexa news sites.
Arabic news articles from the top 10, based on the ranking provided by Alexa news sites. Skip to content Free Datasets Webz. This feature is not supported on mobile. In order to access the free dataset, please use a desktop computer. Found: 35 Datasets. Feb, Popular Blog posts Popular Blog posts - English blog posts with at least Facebook likes within 3 days of original post.
Feb - Mar, Popular News articles Popular News articles - English news articles with at least Facebook likes within 3 days of original post. Mar, Negative company reviews Negative company reviews - Reviews about companies with rating score lower than or equal to 2 stars. Dec, - Mar, Positive company reviews Positive company reviews - Reviews about companies with rating score greater than or equal to 4 stars.
Negative hotel reviews Negative hotel reviews - Reviews about hotels with rating score lower than or equal to 2 stars. Positive hotel reviews Positive hotel reviews - Reviews about hotels with rating score greater than or equal to 4 stars. Negative movie reviews Negative movie reviews - Movie reviews with rating score lower than or equal to 2 stars. With relevant data, scientists, leaders, and policymakers are able to see trends, make policy recommendations, and share critical findings.
Browse the vast quantity of climate- and environment-related data dashboards through the links below. State, local, and federal governments rely on data to guide key decisions and formulate effective policy for their constituents.
The data they generate is often in the form of open data sets that are accessible for citizens and groups to download for their own analyses. Browse the list below for a variety of examples. Get A Free License. Try Tableau Today. Education dashboards provide educators and others a way to visualize critical metrics that affect student success and the fundamentals of education itself.
These dashboards can help inform decision-making at a local, state, and national level. Browse through more education public data sets below.
0コメント