Dataset public
[search 0]
×
Best Dataset podcasts we could find (updated June 2020)
Best Dataset podcasts we could find
Updated June 2020
Join millions of Player FM users today to get news and insights whenever you like, even when you're offline. Podcast smarter with the free podcast app that refuses to compromise. Let's play!
Join the world's best podcast app to manage your favorite shows online and play them offline on our Android and iOS apps. It's free and easy!
More
show episodes
 
Making artificial intelligence practical, productive, and accessible to everyone. Practical AI is a show in which technology professionals, business people, students, enthusiasts, and expert guests engage in lively discussions about Artificial Intelligence and related topics (Machine Learning, Deep Learning, Neural Networks, etc). The focus is on productive implementations and real-world scenarios that are accessible to everyone. If you want to keep up with the latest advances in AI, while k ...
 
Deep Learning (DL) has attracted much interest in a wide range of applications such as image recognition, speech recognition and artificial intelligence, both from academia and industry. This lecture introduces the core elements of neural networks and deep learning, it comprises: (multilayer) perceptron, backpropagation, fully connected neural networks loss functions and optimization strategies convolutional neural networks (CNNs) activation functions regularization strategies common practic ...
 
Deep Learning (DL) has attracted much interest in a wide range of applications such as image recognition, speech recognition and artificial intelligence, both from academia and industry. This lecture introduces the core elements of neural networks and deep learning, it comprises: (multilayer) perceptron, backpropagation, fully connected neural networks loss functions and optimization strategies convolutional neural networks (CNNs) activation functions regularization strategies common practic ...
 
Data profiling is the set of activities and processes to determine the metadata about a given dataset. Profiling data is an important and frequent activity of any IT professional and researcher.It encompasses a vast array of methods to examine data sets and produce metadata. Among the simpler results are statistics, such as the number of null values and distinct values in a column, its data type, or the most frequent patterns of its data values. Metadata that are more difficult to compute us ...
 
The Royal Statistical Society (RSS) is one of the world's most distinguished and renowned statistical societies. It is a learned society for statistics, a professional body for statisticians and a charity which promotes statistics, data and evidence for the public good. It was founded in 1834 as the Statistical Society of London and became the Royal Statistical Society by Royal Charter in 1887. Today the Society has more than 10,000 members around the world, of whom many are professionally q ...
 
De Nog Even Over... podcast heeft geen vast onderwerp of thema, maar gaat over alles dat kan volgen op "nog even over..." dat op een manier mijn interesse heeft gewekt. Ik, Bas Keetelaar, ben 23 jaar en afgestudeerd als interaction designer en heb passie voor technologie, design, games en gadgets.
 
In the Higher Ed Happy Hour, three well-known Washington, DC-based journalists and policy wonks -- Kevin Carey of New America, Andrew Kelly of the American Enterprise Institute, and Libby Nelson of Vox.com -- discuss the latest happenings in higher education policy, research, and popular culture. There are special guests, wonky digressions, and excursions into the shocking and absurd.
 
Loading …
show series
 
Support Overthinking It by becoming a member for $5/month! Matthew Belinkie, Peter Fenzel, Mark Lee, and Matthew Wrather test their knowledge of pop music decade-by-decade, and ovethink what accounts for the gaps. Download (MP3) Subscribe: iTunes Other Apps Further Reading “Identifying Generational Gaps in Music” from The Pudding ’79–’83 from The H…
 
In this episode we discuss how open data on air pollution can lead to actions towards cleaner air. What does ‘open’ data mean and what are the challenges of making air pollution data open and accessible? What is ‘air inequality’ in the broader context of environmental justice? How can people use open air quality data for good? What are the challeng…
 
Server infrastructure traditionally consists of monolithic servers containing all of the necessary hardware to run a computer. These different hardware components are located next to each other, and do not need to communicate over a network boundary to connect the CPU and memory. LegoOS is a model for disaggregated, network-attached hardware. LegoO…
 
Support Overthinking It by becoming a member for $5/month! Peter Fenzel, Mark Lee, and Matthew Wrather read Martin Luther King Jr.’s “Letter from a Birmingham Jail.” Download (MP3) Subscribe: iTunes Other Apps Further Reading “Letter from a Birmingham Jail” (full text) Episode 622: Write Long Letters, Think Long Thoughts, and Pray Long Prayers orig…
 
Kubernetes has become a highly usable platform for deploying and managing distributed systems. The user experience for Kubernetes is great, but is still not as simple as a full-on serverless implementation–at least, that has been a long-held assumption. Why would you manage your own infrastructure, even if it is Kubernetes? Why not use autoscaling …
 
Every software company is a distributed system, and distributed systems fail in unexpected ways. This ever-present tendency for systems to fail has led to the rise of failure testing, otherwise known as chaos engineering. Chaos engineering involves the deliberate failure of subsystems within an overall system to ensure that the system itself can be…
 
Brex is a credit card company that provides credit to startups, mostly companies which have raised money. Brex processes millions of transactions, and uses the data from those transactions to assess creditworthiness, prevent fraud, and surface insights for the users of their cards. Brex is full of interesting engineering problems. The high volume o…
 
On the heels of NVIDIA’s latest announcements, Daniel and Chris explore how the new NVIDIA Ampere architecture evolves the high-performance computing (HPC) landscape for artificial intelligence. After investigating the new specifications of the NVIDIA A100 Tensor Core GPU, Chris and Daniel turn their attention to the data center with the NVIDIA DGX…
 
Devices on the edge are becoming more useful with improvements in the machine learning ecosystem. TensorFlow Lite allows machine learning models to run on microcontrollers and other devices with only kilobytes of memory. Microcontrollers are very low-cost, tiny computational devices. They are cheap, and they are everywhere. The low-energy embedded …
 
Devices on the edge are becoming more useful with improvements in the machine learning ecosystem. TensorFlow Lite allows machine learning models to run on microcontrollers and other devices with only kilobytes of memory. Microcontrollers are very low-cost, tiny computational devices. They are cheap, and they are everywhere. The low-energy embedded …
 
Support Overthinking It by becoming a member for $5/month! Peter Fenzel, Mark Lee, and Matthew Wrather discuss the latest installment of The Trip, directed by Michael Winterbottom and starring Steve Coogan and Rob Brydon. This installment takes the frenemies to Greece, retracing the steps (oar-strokes?) of Odysseus, and addresses history, mortality…
 
Over the last 5 years, web development has matured considerably. React has become a standard for frontend component development. GraphQL has seen massive growth in adoption as a data fetching middleware layer. The hosting platforms have expanded beyond AWS and Heroku, to newer environments like Netlify and Vercel. These changes are collectively kno…
 
Geospatial analytics tools are used to render visualizations for a vast array of applications. Data sources such as satellites and cellular data can gather location data, and that data can be superimposed over a map. A map-based visualization can allow the end user to make decisions based on what they see. ArcGIS is one of the most widely used geos…
 
Customer data infrastructure is a type of tool for saving analytics and information about your customers. The company that is best known in this category is Segment, a very popular API company. This customer data is used for making all kinds of decisions around product roadmap, pricing, and design. RudderStack is a company built around open source …
 
Matterport is a company that builds 3-D imaging for the inside of buildings, construction sites, and other locations that require a “digital twin.” Generating digital images of the insides of buildings has a broad spectrum of applications, and there are considerable engineering challenges in building such a system. Matterport’s hardware stack invol…
 
There are many bad recipe web sites. Every time I navigate to a recipe website, it feels like my browser is filling up with spyware. The page loads slowly, everything seems broken, I can feel the 25 different JavaScript adtech tags interrupting each other. Whether I am searching for banana bread or a spaghetti sauce recipe, recipe sites usually mak…
 
Support Overthinking It by becoming a member for $5/month! Peter Fenzel, Mark Lee, and Matthew Wrather pay tribute to the great Fred Willard (c1930s—2020) by overthinking and appreciating him in Best in Show, the Christopher Guest mockumentary about dogs and their people. Download (MP3) Subscribe: iTunes Other Apps Episode 620: Why Don’t You Put th…
 
In this episode we discuss air pollution in Ghana and its intersection with public health, sources, and measurements using low-cost sensors. How does air pollution intersect with other health concerns, such as that of nutrition in Ghana? What are the sources of air pollution in a country like Ghana? What are the differences between household (indoo…
 
Amazon’s virtual server instances have come a long way since the early days of EC2. There are now a wide variety of available configuration options for spinning up an EC2 instance, which can be chosen from based on the workload that will be scheduled onto a virtual machine. There are also Fargate containers and AWS Lambda functions, creating even m…
 
A credit score is a rating that allows someone to qualify for a line of credit, which could be a loan such as a mortgage, or a credit card. We are assigned a credit score based on a credit history, which could be related to work history, rental payments, or loan repayments. One problem with the credit scoring system is that it is not internationali…
 
A large software company such as Dropbox is at a constant risk of security breaches. These security breaches can take the form of social engineering attacks, network breaches, and other malicious adversarial behavior. This behavior can be surfaced by analyzing collections of log data. Log-based threat response is not a new technique. But how should…
 
Infrastructure-as-code tools are used to define the architecture of software systems. Common infrastructure-as-code tools include Terraform and AWS CloudFormation. When infrastructure is defined as code, we can use static analysis tools to analyze that code for configuration mistakes, just as we could analyze a programming language with traditional…
 
Chandler McCann tells Daniel and Chris about how DataRobot engaged in a project to develop sustainable water solutions with the Global Water Challenge (GWC). They analyzed over 500,000 data points to predict future water point breaks. This enabled African governments to make data-driven decisions related to budgeting, preventative maintenance, and …
 
Social distancing has been imposed across the United States. We are running an experiment unlike anything before it in history, and it is likely to have a lasting impact on human behavior. By looking at location data of how people are moving around today, we can examine the real-world impacts of social distancing. SafeGraph is a company that provid…
 
Dropbox is a consumer storage product with petabytes of data. Dropbox was originally started on the cloud, backed by S3. Once there was a high enough volume of data, Dropbox created its own data centers, designing hardware for the express purpose of storing user files. Over the last 13 years, Dropbox’s infrastructure has developed hardware, softwar…
 
“Data stream” is a word that can be used in multiple ways. A stream can refer to data in motion or data at rest. When a stream is data in motion, an endpoint is receiving new pieces of data on a continual basis. Each new data point is sent over the wire and captured by the other end. Another way a stream can be represented is as a sequence of event…
 
Redis is an in-memory object storage system that is commonly used as a cache for web applications. This core primitive of in-memory object storage has created a larger ecosystem encompassing a broad set of tools. Redis is also used for creating objects such as queues, streams, and probabilistic data structures. Machine learning systems also need ac…
 
In episode 28, we interviewed Leah Silen from the NumFocus organization. She introduced us to the goals and the mission of the organization. We then had a discussion about the different levels of support provided by the organization to its member projects. She informed us about the legal, financial, technological and logistical support that can be …
 
For many applications, a transactional MySQL database is the source of truth. To make a MySQL database scale, some developers deploy their database using Vitess, a sharding system built on top of Kubernetes. Jiten Vaidya and Anthony Yeh work at PlanetScale, a company that focuses on building and supporting MySQL databases sharded with Vitess. Their…
 
Daniel and Chris get you Fully-Connected with AI questions from listeners and online forums: What do you think is the next big thing? What are CNNs? How does one start developing an AI-enabled business solution? What tools do you use every day? What will AI replace? And more… Discuss on Changelog News Sponsors DigitalOcean – DigitalOcean’s develope…
 
We are all living in social isolation due to the quarantine from COVID-19. Isolation is changing our habits and our moods, ravaging the economy, and changing how we work. One positive change is that more people have been reconnecting with their friends and family over frequent calls and video chats. Isolation is not a normal way for humans to live.…
 
A data warehouse is a system for performing fast queries on large amounts of data. A data lake is a system for storing high volumes of data in a format that is slow to access. A typical workflow for a data engineer is to pull data sets from this slow data lake storage into the data warehouse for faster querying. Apache Spark is a system for fast pr…
 
A content management system (CMS) defines how the content on a website is arranged and presented. The most widely used CMS is WordPress, the open source tool that is written in PHP. A large percentage of the web consists of WordPress sites, and WordPress has a huge ecosystem of plugins and templates. Despite the success of WordPress, the JAMStack r…
 
A data workflow scheduler is a tool used for connecting multiple systems together in order to build pipelines for processing data. A data pipeline might include a Hadoop task for ETL, a Spark task for stream processing, and a TensorFlow task to train a machine learning model. The workflow scheduler manages the tasks in that data pipeline and the lo…
 
A relational database often holds critical operational data for a company, including user names and financial information. Since this data is so important, a relational database must be architected to avoid data loss. Relational databases need to be a distributed system in order to provide the fault tolerance necessary for production use cases. If …
 
Daniel and Chris have a fascinating discussion with Anna Goldie and Azalia Mirhoseini from Google Brain about the use of reinforcement learning for chip floor planning - or placement - in which many new designs are generated, and then evaluated, to find an optimal component layout. Anna and Azalia also describe the use of graph convolutional neural…
 
Despite its application to myriad humanitarian and civil use cases, automated road network extraction from overhead satellite imagery remains quite challenging. However, the SpaceNet 5 challenge made significant progress in this field with top participants being able to extract both road networks and speed/travel time estimates for each roadway. On…
 
Python is the most widely used language for data science, and there are several libraries that are commonly used by Python data scientists including Numpy, Pandas, and scikit-learn. These libraries improve the user experience of a Python data scientist by giving them access to high level APIs. Data science is often performed over huge datasets, and…
 
Support Overthinking It by becoming a member for $5/month! Ben Adams, Mark Lee, and Matthew Wrather, spurred by (a lack of?) news out of North Korea and autocracies everywhere, overthink The Death of Stalin, Armando Iannucci’s comedy about the days before and after the dictator’s death. Download (MP3) Subscribe: iTunes Other Apps Further Reading DP…
 
Chatbots became widely popular around 2016 with the growth of chat platforms like Slack and voice interfaces such as Amazon Alexa. As chatbots came into use, so did the infrastructure that enabled chatbots. NLP APIs and complete chatbot frameworks came out to make it easier for people to build chatbots. The first suite of chatbot frameworks were la…
 
Chatbots became widely popular around 2016 with the growth of chat platforms like Slack and voice interfaces such as Amazon Alexa. As chatbots came into use, so did the infrastructure that enabled chatbots. NLP APIs and complete chatbot frameworks came out to make it easier for people to build chatbots. The first suite of chatbot frameworks were la…
 
Serverless computing is a way of designing applications that do not directly address or deploy application code to servers. Serverless applications are composed of stateless functions-as-a-service and stateful data storage systems such as Redis or DynamoDB. Serverless applications allow for scaling up and down the entire architecture, because each …
 
Loading …
Google login Twitter login Classic login