Home
Categories
EXPLORE
True Crime
Comedy
Business
Society & Culture
History
Sports
Health & Fitness
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/d5/5c/87/d55c8700-ceaf-f9f2-47f9-b77841560143/mza_17377174697774825118.jpg/600x600bb.jpg
Data Science Tech Brief By HackerNoon
HackerNoon
127 episodes
1 month ago
Learn the latest data science updates in the tech world.
Show more...
Tech News
News
RSS
All content for Data Science Tech Brief By HackerNoon is the property of HackerNoon and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Learn the latest data science updates in the tech world.
Show more...
Tech News
News
Episodes (20/127)
Data Science Tech Brief By HackerNoon
Applying Transitive Closure to Sort Products Into Categories, Considering Nesting and Overlaps

This story was originally published on HackerNoon at: https://hackernoon.com/applying-transitive-closure-to-sort-products-into-categories-considering-nesting-and-overlaps.
A guide to efficiently managing nested categories and overlapping products, ensuring fast retrieval without duplicates in e-commerce systems.
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #data-management, #software-architecture, #product-categorization, #graph-theory, #microservices, #optimize-data-storage, #transitive-closure, #advanced-indexing, and more.

This story was written by: @egorgrushin. Learn more about this writer by checking @egorgrushin's about page, and for more stories, please visit hackernoon.com.

Handling product categorization in e-commerce can be quite the task, especially when nested categories and overlapping products make efficient retrieval without duplicates a real challenge. The method I found has a major impact on performance: setting up proper data storage, separating data for reading and modification, using relational and NoSQL databases, and applying graph theory to handle complex category nesting. The step-by-step guide shows how to sort out efficient data storage, use transitive closure for advanced indexing, build a service to maintain and update the graph, and take advantage of database indexing to avoid unnecessary sorting in RAM.

Show more...
1 month ago
15 minutes

Data Science Tech Brief By HackerNoon
98% of Data Strategies Fail: Let's Fix It

This story was originally published on HackerNoon at: https://hackernoon.com/98percent-of-data-strategies-fail-lets-fix-it.
Learn how to fix failing data strategies using the '5 W's' framework. Transform your approach to KPIs and drive real business value with actionable insights.
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #data-strategy, #kpi-management, #business-intelligence, #data-driven-decisions, #executive-leadership, #analytics-roi, #data-roi, #data-governance, and more.

This story was written by: @liorb. Learn more about this writer by checking @liorb's about page, and for more stories, please visit hackernoon.com.

Even the most well-equipped organizations can find themselves serving up a mess instead of actionable insights. Here's a step-by-step process of fixing your data strategy, ensuring that you're serving up actionable data instead of a recipe for disaster. In the following sections, we'll dive into the common data strategy nightmares.

Show more...
1 year ago
11 minutes

Data Science Tech Brief By HackerNoon
How To Measure The Results Of In-App Events When Onelinks Don’t Work

This story was originally published on HackerNoon at: https://hackernoon.com/how-to-measure-the-results-of-in-app-events-when-onelinks-dont-work.
How To Measure The Results Of In-App Events When Onelinks Don’t Work
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #analytics, #onelink, #inapp-events, #marketing, #app-store, #mobile-apps, #digital-marketing, #good-company, and more.

This story was written by: @socialdiscoverygroup. Learn more about this writer by checking @socialdiscoverygroup's about page, and for more stories, please visit hackernoon.com.

Many app developers and marketing managers face the challenge of accurately measuring the impact of In-App Events (IAEs) on the App Store. While IAEs have proven effective for re-engaging users, attracting new downloads, and increasing revenue, traditional tracking methods like OneLink don’t actually include IAEs. Major mobile attribution platforms confirm that currently there is no way to track IAEs properly. At Social Discovery Group, our portfolio of 60+ dating and entertainment brands is supported by a team of over 100 marketers dedicated to app growth and development. We’re used to measuring all our marketing efforts in terms of financial value. Eventually, we’ve managed to develop our own composite way to evaluate IAEs, and are going to share it with you.

Show more...
1 year ago
5 minutes

Data Science Tech Brief By HackerNoon
How AI-Powered Data Mapping is Democratizing Data Management

This story was originally published on HackerNoon at: https://hackernoon.com/how-ai-powered-data-mapping-is-democratizing-data-management.
Learn how AI-powered data mapping is transforming data management, making it more accessible and efficient for everyone.
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #data-mapping, #data-management, #big-data, #ai-powered, #ai-powered-data-management, #democratizing-data-management, #data-science, #ai-powered-data-mapping, and more.

This story was written by: @kristenburke. Learn more about this writer by checking @kristenburke's about page, and for more stories, please visit hackernoon.com.

AI is revolutionizing data mapping by automating and simplifying the process, making data management more efficient and accessible for businesses and non-technical users alike.

Show more...
1 year ago
8 minutes

Data Science Tech Brief By HackerNoon
Data Engineering: What’s the Value of API Security in the Generative AI Era?

This story was originally published on HackerNoon at: https://hackernoon.com/data-engineering-whats-the-value-of-api-security-in-the-generative-ai-era.
Discover the importance of API security in the age of Generative AI. Learn how robust API protection ensures data integrity.
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #data-engineering, #generative-ai, #ai-regulation, #api-security, #data-security, #data-privacy, #threat-detection, #cybersecurity-best-practices, and more.

This story was written by: @karthikrajashekaran. Learn more about this writer by checking @karthikrajashekaran's about page, and for more stories, please visit hackernoon.com.

API security is crucial in the era of Generative AI, ensuring data integrity, protecting user privacy, and enabling secure and efficient AI integration. Robust API protection helps prevent unauthorized access, data breaches, and potential misuse of AI capabilities.

Show more...
1 year ago
5 minutes

Data Science Tech Brief By HackerNoon
Say Goodbye to Outdated Diagrams: Automate Your Infrastructure Visualization

This story was originally published on HackerNoon at: https://hackernoon.com/say-goodbye-to-outdated-diagrams-automate-your-infrastructure-visualization.
Automate your infrastructure diagrams. Guide helps you maintain fresh, accurate visuals with minimal effort, perfect for managing
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #visualization, #cloud-infrastructure, #terraform, #diagram, #infrastructure-as-code, #cloud, #aws, #infrastructure-visualization, and more.

This story was written by: @vladimirf. Learn more about this writer by checking @vladimirf's about page, and for more stories, please visit hackernoon.com.

Tired of making awesome infrastructure diagrams that become outdated as soon as you save them? Yeah, me too. Luckily, there are tools out there to help.

Show more...
1 year ago
7 minutes

Data Science Tech Brief By HackerNoon
Why C-Suite Executives Won’t Cut it Without Data Skills Anymore

This story was originally published on HackerNoon at: https://hackernoon.com/why-c-suite-executives-wont-cut-it-without-data-skills-anymore.
Modern executives must master data skills to navigate data privacy, cybersecurity, and strategic decisions. Learn why C-suite leaders can't afford to lag behind
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #data-literacy, #thought-leadership, #leadership-skills, #data-skills, #data-governance, #data-visualization-tools, #cybersecurity-executives, #hackernoon-top-story, and more.

This story was written by: @znenad079. Learn more about this writer by checking @znenad079's about page, and for more stories, please visit hackernoon.com.

Every industry generates massive amounts of data, which is now being used for better decision-making. One of today’s most pressing challenges for executives is data privacy concerns and cybersecurity. Modern executives must have data skills to understand the flow of valuable data in their company and know how to make it work for them.

Show more...
1 year ago
6 minutes

Data Science Tech Brief By HackerNoon
Meet New & Improved BigQuery: Single, Unified AI-Ready Data Platform

This story was originally published on HackerNoon at: https://hackernoon.com/meet-new-and-improved-bigquery-single-unified-ai-ready-data-platform.
Google has gone a step further and unified key data Google Cloud analytics capabilities under BigQuery - now the single, AI-ready data analytics platform. 
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #data-analytics, #google-bigquery, #bigquery-and-google-cloud, #ai-integration, #big-query-and-gemini, #good-company, #hackernoon-top-story, #real-time-data-analytics, and more.

This story was written by: @googlecloud. Learn more about this writer by checking @googlecloud's about page, and for more stories, please visit hackernoon.com.

We’ve gone a step further and unified key data Google Cloud analytics capabilities under BigQuery, which is now the single, AI-ready data analytics platform. BigQuery incorporates key capabilities from multiple Google Cloud analytics services into a single product experience that offers the simplicity and scale you need to manage structured data in BigQuery tables, unstructured data like images, audience and documents, and streaming workloads, all with the best price-performance. 

Show more...
1 year ago
10 minutes

Data Science Tech Brief By HackerNoon
Decoding Transformers' Superiority over RNNs in NLP Tasks

This story was originally published on HackerNoon at: https://hackernoon.com/decoding-transformers-superiority-over-rnns-in-nlp-tasks.
Explore the intriguing journey from Recurrent Neural Networks (RNNs) to Transformers in the world of Natural Language Processing in our latest piece: 'The Trans
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #nlp, #transformers, #llms, #natural-language-processing, #large-language-models, #rnn, #machine-learning, #neural-networks, and more.

This story was written by: @artemborin. Learn more about this writer by checking @artemborin's about page, and for more stories, please visit hackernoon.com.

Despite Recurrent Neural Networks (RNNs) designed to mirror certain aspects of human cognition, they've been surpassed by Transformers in Natural Language Processing tasks. The primary reasons include RNNs' issues with the vanishing gradient problem, difficulty in capturing long-range dependencies, and training inefficiencies. The hypothesis that larger RNNs could mitigate these issues falls short in practice due to computational inefficiencies and memory constraints. On the other hand, Transformers leverage their parallel processing ability and self-attention mechanism to efficiently handle sequences and train larger models. Thus, the evolution of AI architectures is driven not only by biological plausibility but also by practical considerations such as computational efficiency and scalability.

Show more...
1 year ago
9 minutes

Data Science Tech Brief By HackerNoon
How to Enable Auto-Start for Apache DolphinScheduler

This story was originally published on HackerNoon at: https://hackernoon.com/how-to-enable-auto-start-for-apache-dolphinscheduler.
To set DolphinScheduler to start automatically upon system boot, you typically need to configure it as a system service.
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #bigdata, #data-science, #workflow-automation, #linux, #how-to-enable-auto-start, #apache-dolphinscheduler, #apache-dolphinscheduler-guide, and more.

This story was written by: @williamguo. Learn more about this writer by checking @williamguo's about page, and for more stories, please visit hackernoon.com.

To set DolphinScheduler to start automatically upon system boot, you typically need to configure it as a system service. The following are general steps, which may vary depending on your operating system.

Show more...
1 year ago
4 minutes

Data Science Tech Brief By HackerNoon
Benchmarking Apache Kafka: Performance-per-price

This story was originally published on HackerNoon at: https://hackernoon.com/benchmarking-apache-kafka-performance-per-price.
This is a study comparing environments for Apache Kafka. The ultimate goal is to find the most effective setup and achieve the best price-performance ratio.
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #apache-kafka, #amd, #arm, #aws, #gcp, #kafka-performance, #benchmarking-apache-kafka, #hackernoon-top-story, and more.

This story was written by: @mishaepikhin. Learn more about this writer by checking @mishaepikhin's about page, and for more stories, please visit hackernoon.com.

ARM rocks. Modern expensive architecture does not always mean “better”.

Show more...
1 year ago
13 minutes

Data Science Tech Brief By HackerNoon
When and When Not to Use Apache Kafka as a Database

This story was originally published on HackerNoon at: https://hackernoon.com/when-and-when-not-to-use-apache-kafka-as-a-database.
Discover how Apache Kafka’s data retention and querying capabilities make it similar to a database and learn when to use Kafka for database-like use cases.
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #apache-kafka, #kafka-vs-database, #kafka-as-a-database, #real-time-data-processing, #database-management, #kafka-querying-capabilities, #open-source-event-streaming, #apache-kafka-for-data-storage, and more.

This story was written by: @aahil. Learn more about this writer by checking @aahil's about page, and for more stories, please visit hackernoon.com.

Apache Kafka, while not a traditional database, has database-like properties such as data retention and querying capabilities. This article explores when Kafka can be used for database-like purposes and when it is best suited as a streaming platform.

Show more...
1 year ago
9 minutes

Data Science Tech Brief By HackerNoon
A Leader's Guide to Data-Driven Success

This story was originally published on HackerNoon at: https://hackernoon.com/a-leaders-guide-to-data-driven-success.
Transform data from a source of frustration into a powerful business tool with this practical guide for executives.
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #data-strategy, #business-insights, #data-management, #data-literacy, #data-analytics, #business-growth, #information-overload, #business-strategy, and more.

This story was written by: @liorb. Learn more about this writer by checking @liorb's about page, and for more stories, please visit hackernoon.com.

Despite having more information than ever, making informed decisions seems increasingly challenging. This guide is designed to help you transform data from a source of frustration into a powerful tool for driving business growth. From my own experience, I've seen professionals dedicating up to 50% of their workweek to validating data.

Show more...
1 year ago
7 minutes

Data Science Tech Brief By HackerNoon
Seamlessly Migrate Your On-Premise Data Pipeline to Azure with These Key Steps

This story was originally published on HackerNoon at: https://hackernoon.com/seamlessly-migrate-your-on-premise-data-pipeline-to-azure-with-these-key-steps.
Scaling AI/ML Data Needs: Migrating On-Premise Data Engineering Workloads to Azure Cloud
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #data-engineering, #azure-data-factory, #data-pipeline-migration, #azure-migration, #azure-data-integration, #cloud-data-transfer, #cloudera-to-azure, #azure-security-compliance, and more.

This story was written by: @amlanpatnaik. Learn more about this writer by checking @amlanpatnaik's about page, and for more stories, please visit hackernoon.com.

This guide details the process of migrating an on-premise Cloudera data system to Azure, covering key considerations, challenges, and best practices to ensure a smooth and secure transition.

Show more...
1 year ago
12 minutes

Data Science Tech Brief By HackerNoon
Data Collection for Product Managers

This story was originally published on HackerNoon at: https://hackernoon.com/data-collection-for-product-managers.
Discover how product managers can bridge the gap between intuition and data to optimize product improvement with best practices and real-world examples.
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #data-collection, #startups, #data-product-management, #data-driven-insights, #data-driven-decision-making, #product-manager, #product-management-tips, #how-to-collect-data, and more.

This story was written by: @carolinagarcia. Learn more about this writer by checking @carolinagarcia's about page, and for more stories, please visit hackernoon.com.

Discover how product managers can bridge the gap between intuition and data to optimize product improvement. This guide explores the importance of data-driven decision-making, offering best practices and real-world examples from companies like NuBank, Monzo, Deliveroo, and Booking.com. Learn how to acquire insights from customer feedback, track performance metrics, monitor market trends, and refine product roadmaps through iterative experimentation. Become a data-driven PM and create products that users will love.

Show more...
1 year ago
7 minutes

Data Science Tech Brief By HackerNoon
Data Collection for Product Managers

This story was originally published on HackerNoon at: https://hackernoon.com/data-collection-for-product-managers.
Discover how product managers can bridge the gap between intuition and data to optimize product improvement with best practices and real-world examples.
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #data-collection, #startups, #data-product-management, #data-driven-insights, #data-driven-decision-making, #product-manager, #product-management-tips, #how-to-collect-data, and more.

This story was written by: @carolinagarcia. Learn more about this writer by checking @carolinagarcia's about page, and for more stories, please visit hackernoon.com.

Discover how product managers can bridge the gap between intuition and data to optimize product improvement. This guide explores the importance of data-driven decision-making, offering best practices and real-world examples from companies like NuBank, Monzo, Deliveroo, and Booking.com. Learn how to acquire insights from customer feedback, track performance metrics, monitor market trends, and refine product roadmaps through iterative experimentation. Become a data-driven PM and create products that users will love.

Show more...
1 year ago
7 minutes

Data Science Tech Brief By HackerNoon
Leveraging Data Granularity, Distribution, and Modeling for Effective Product Management

This story was originally published on HackerNoon at: https://hackernoon.com/leveraging-data-granularity-distribution-and-modeling-for-effective-product-management.
These three fundamental concepts are exceptionally needed for being able to use data to enhance product strategy.
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #data-analysis, #data-driven-product-management, #data-granularity, #data-distribution, #product-strategy, #user-behavior-analysis, #data-modeling, #business-strategy, and more.

This story was written by: @gevorgkazaryan. Learn more about this writer by checking @gevorgkazaryan's about page, and for more stories, please visit hackernoon.com.

Granularity determines the level of detail available in the data, which directly impacts what you can observe and analyze. For instance, finer granularity provides more detailed insights but may require more sophisticated handling and processing techniques. Distribution helps identify the patterns and spread of data, which is critical for selecting the appropriate analysis techniques and ensuring the accuracy of predictive models. Data Modeling uses the insights gained from understanding granularity and distribution to build predictive or descriptive models that inform decision-making and strategy.

Show more...
1 year ago
11 minutes

Data Science Tech Brief By HackerNoon
How Vectors, Rag and Llama 3 Are Changing First-Party Data

This story was originally published on HackerNoon at: https://hackernoon.com/how-vectors-rag-and-llama-3-are-changing-first-party-data.
In the battle for the best data, is first-party better? Not by itself, but it could be with vectors, frameworks like RAG, and open-source models
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #first-party-data, #big-data, #datasets, #rag-architecture, #retrieval-augmented-generation, #vector-embedding, #ai-models-for-data-analysis, #hackernoon-top-story, and more.

This story was written by: @danielsvonava. Learn more about this writer by checking @danielsvonava's about page, and for more stories, please visit hackernoon.com.

The push for first-party data generally goes that companies need to become better stewards of data acquisition and management. Consumers increasingly want to know who is hanging onto their personal information, how they got it, why they have it, and what is being done with it. The push to take back control of data seems essential, but is it practical?

Show more...
1 year ago
7 minutes

Data Science Tech Brief By HackerNoon
16 Best Sklearn Datasets for Building Machine Learning Models

This story was originally published on HackerNoon at: https://hackernoon.com/16-best-sklearn-datasets-for-building-machine-learning-models.
Sklearn datasets are included as part of the scikit-learn (sklearn) library, so they come pre-installed with the library.
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #sklearn, #datasets, #datascience, #sklearn-datasets, #machine-learning, #python-programming, #dataset, #hackernoon-top-story, and more.

This story was written by: @datasets. Learn more about this writer by checking @datasets's about page, and for more stories, please visit hackernoon.com.

Sklearn is a Python module for machine learning built on top of SciPy. It is unique due to its wide range of algorithms and ease of use. Data powers machine learning algorithms and scikit-learn. Sklearn offers high quality datasets that are widely used by researchers, practitioners and enthusiasts.

Show more...
1 year ago
21 minutes

Data Science Tech Brief By HackerNoon
Enhancing Audit Processes With Advanced Analytical Tools

This story was originally published on HackerNoon at: https://hackernoon.com/enhancing-audit-processes-with-advanced-analytical-tools.
Discover how advanced analytical tools streamline audit processes, boosting accuracy and efficiency for tech professionals.
Check more stories related to data-science at: https://hackernoon.com/c/data-science. You can also check exclusive content about #advanced-analytics, #software-development, #audit, #analytics-based-auditing, #auditing-tech, #data-visualization, #complex-event-processing, #ai-in-analytics, and more.

This story was written by: @devinpartida. Learn more about this writer by checking @devinpartida's about page, and for more stories, please visit hackernoon.com.

Developers can leverage advanced analytics tools to streamline and improve software, compliance and internal controls auditing. Advanced analytics tools like artificial intelligence, complex event processing and data mining enable 100% population testing. They eliminate the need for sampling, thereby reducing bias and error risks. Autonomous technologies like AI are particularly beneficial since they eliminate human error.

Show more...
1 year ago
5 minutes

Data Science Tech Brief By HackerNoon
Learn the latest data science updates in the tech world.