
Data Science Conversations (Damien Deighan and Philipp Diesinger)
Explore every episode of Data Science Conversations
Pub. Date | Title | Duration | |
---|---|---|---|
25 Nov 2024 | Maximising the Impact of Your Data & AI Consulting Projects | 00:46:47 | |
In our latest episode of the Data Science Conversations Podcast, we spoke with Christoph Sporleder, Managing Partner at Rewire, about the evolving role of consulting in the data and AI space. This conversation is a must listen for anyone dealing with the challenges of integrating AI into business processes or considering an AI project with an external consulting firm. Christoph draws from decades of experience, offering practical advice and actionable insights for organizations and practitioners alike. Key Topics Discussed 1. Evolution of Data and Cloud Computing The shift from local computing to cloud technologies, enabling broader data integration and advanced analytics, with the rise of IoT and machine data. 2. Data Management Challenges Discussion on the evolution from data warehouses to data lakes and the emerging concept of data mesh for better governance and scalability. 3. Importance of Strategy in AI Why a clear strategy is crucial for AI adoption, including aligning organizational leadership and identifying impactful use cases. 4. Sectoral Adoption of Data and AI Differences in adoption across sectors, with early adopters in finance and insurance versus later adoption in manufacturing and infrastructure. 5. Consulting Models and Engagement Insights into consulting engagement types, including strategy consulting, system integration, and body leasing, and their respective challenges and benefits. 6. Challenges in AI Implementation Common pitfalls in AI projects, such as misalignment with business goals, inadequate infrastructure planning, and siloed lighthouse initiatives. 7. Leadership’s Role in AI Success The critical need for senior leadership commitment to drive AI adoption, ensure process integration, and manage organizational change. 8. Effective Collaboration with Consultants Best practices for successful partnerships with consultants, including aligning on objectives, managing personnel transitions, and setting clear engagement expectations. 9. Future Trends in Data and AI Emerging trends like componentized AI architectures, Gen AI integration, and the growing focus on embedding AI within business processes. 10. Tips for Managing Long-Term Projects Strategies for handling staff rotations and maintaining project continuity in consulting engagements, emphasizing planning and communication. | |||
24 Jul 2024 | Future AI Trends: Strategy, Hardware and AI Security at Intel | 01:02:33 | |
In this episode, we sit down with Steve Orrin, Federal Chief Technology Officer at Intel Corporation. Steve shares his extensive experience and insights on the transformative power of AI and its parallels with past technological revolutions. He discusses Intel’s pioneering role in enabling these shifts through innovations in microprocessors, wireless connectivity, and more. Steve highlights the pervasive role of AI in various industries and everyday technology, emphasizing the importance of a heterogeneous computing architecture to support diverse AI environments. He talks about the challenges of operationalizing AI, ensuring real-world reliability, and the critical need for robust AI security. Confidential computing emerges as a key solution for protecting AI workloads across different platforms. The episode also explores Intel’s strategic tools like oneAPI and OpenVINO, which streamline AI development and deployment. This episode is a must-listen for anyone interested in the evolving landscape of AI and its real-world applications. Intel's Legacy and Technological Revolutions
AI's Current and Future Landscape
Intel's Approach to AI
Challenges and Solutions in AI Deployment
AI Security Concerns
Innovations in AI Hardware and Software
Collaboration and Standards in AI Security
Advice for Aspiring AI Security Professionals
Exciting Developments in AI
| |||
29 Sep 2021 | Perry Marshall - Why Evolutionary Biology Has Big Implications For Future AI Development | 00:59:53 | |
In this episode we are joined by Perry Marshall to talk about his latest scientific paper entitled “Biology Transcends the Limits of Computation”. We also discuss his $10 million Evolution 2.0 Science Prize, which is the largest prize in the world in science currently. His paper pushes the boundaries in the field of evolutionary biology and his science prize is driving some truly fascinating and thought provoking implications for the development of strong AI. | |||
29 Jan 2024 | The Path to Responsible AI with Julia Stoyanovich of NYU | 00:48:09 | |
In this enlightening episode, Dr. Julia Stoyanovich delves into the world of responsible AI, exploring the ethical, societal, and technological implications of AI systems. She underscores the importance of global regulations, human-centric decision-making, and the proactive management of biases and risks associated with AI deployment. Through her expert lens, Dr. Stoyanovich advocates for a future where AI is not only innovative but also equitable, transparent, and aligned with human values. Julia is an Institute Associate Professor at NYU in both the Tandon School of Engineering, and the Center for Data Science. In addition she is Director of the Center for Responsible AI also at NYU. Her research focuses on responsible data management, fairness, diversity, transparency, and data protection in all stages of the data science lifecycle. Episode Summary -
| |||
01 Nov 2023 | The future of LLMs, ELMs and the semantic layer | 00:34:50 | |
In this episode Tarush Aggarwal, formerly of Salesforce and WeWork is back on the podcast to discuss the evolution of the Semantic layer and how that can help practitioners get results from LLMs. We also discuss how smaller ELMs (expert language models) might be the future when it comes to consistent reliable outputs from Generative AI and also the impact of all of this on traditional BI tools. | |||
29 Aug 2024 | The Evolution of GenAI: From GANs to Multi-Agent Systems | 00:43:27 | |
Early Interest in Generative AI
Development of GANs and Early Language Models since 2016
Launch of GenerativeAI.net and Online Course
Defining Generative AI
Evolution of GenAI Technologies
Impact of Computing Power on GenAI
Generative AI in Business Applications
Retrieval Augmented Generation (RAG) Architecture
Technological Drivers of GenAI
Small vs. Large Language Models
Challenges in Implementing GenAI Systems
Measuring GenAI Performance
Emerging Trends in GenAI
| |||
21 Sep 2020 | Trailer Episode | 00:03:37 | |
Your co-Hosts Damien Deighan and Philipp Diesinger discuss the Data Science conversations Podcast concept. You will find out what to expect from the show in the coming weeks. | |||
26 Jan 2021 | How AI Imaging Is Transforming Satellite Imagery | 00:52:27 | |
In this episode we discuss the rapidly developing field of Satellite Imaging. Our guests on this show are Heidi Hurst & Jerry He. They are two remarkable industry Data Scientists with a strong academic pedigree and experience in the field of Satellite Image Processing. Heidi is based in Washington DC and Jerry is based in New York. Join us as they discuss their journey into Satellite Imaging and share with us the latest developments in this fascinating and evolving area of Data Science. Episode Summary
RESOURCES: Cool Visual - One Hour of active Satellites orbiting Earth:
DOTA - https://captain-whu.github.io/DOTA/ - Open dataset for object detection in overhead imagery
COwC - https://gdo152.llnl.gov/cowc/ - Cars Overhead with Context - specific detection dataset for car counting algorithms
xView - http://xviewdataset.org/ - dataset put together by the National Geospatial Intelligence Agency for an object detection challenge, including some particularly rare classes | |||
18 May 2022 | How Observability is Advancing Data Reliability and Data Quality | 00:43:49 | |
Modern Data Infrastructures and platforms store huge amounts of multidimensional data. But - data pipelines frequently break and a machine learning algorithm's performance is only as good as the quality and reliability of the data itself. In this episode we are joined by Lior Gavish and Ryan Kearns of Monte Carlo, to talk about how the new concept of Data Observability is advancing Data Reliability and Data Quality at Scale. Episode Summary
| |||
15 Feb 2023 | How Science is (mis)communicated in Online Media | 00:33:56 | |
Ágnes Horvát is an Assistant Professor in Communication and Computer Science at Northwestern University. Her work focuses on understanding how online networks induce biased information production, sharing and processing across digital platforms. - The new Post-normal era for science - Having an awareness of the context and values that impact scientific research
Reducing the problem of miscommunication - with whom does the responsibility lie? | |||
20 Jul 2021 | How XPRIZE is enabling AI for social good | 00:40:08 | |
In this episode we are joined by the Director of AI and Data Operations at XPRIZE whose career path into the world of AI is fascinating. Neama Dadkhahnikoo shares his journey from his early days at Boeing back in 2005, through start up ventures, Techspert and Caregivers Direct, and re-training right through to the present day at XPRIZE. He reveals how anyone has the potential to make a real difference in using AI to help solve real world problems.
| |||
17 Aug 2021 | How AI Is Driving The Eradication Of Malaria | 00:36:24 | |
In this episode we are joined by Arnon Houri Yafin, an Israeli entrepreneur who is the founder of a company called Zzapp Malaria, which recently won the AI XPRIZE sponsored by IBM Watson. Their work in using AI to eliminate malaria in Africa is both interesting and inspirational. Episode Summary
| |||
08 Dec 2023 | Transforming Freight Logistics with AI and Machine Learning | 01:01:43 | |
Luis Moreira-Matias is Senior Director of Artificial Intelligence at sennder, Europe’s leading digital freight forwarder. At sennder, Luis founded sennAI: sennder’s organization that oversees the creation (from R&D to real-world productization) of proprietary AI technology for the road logistics industry. During his 15 years of career, Luis led 50+ FTEs across 4+ organisations to develop award-winning ML solutions to address real-world problems in various fields such as e-commerce, travel, logistics, and finance. Luis holds a Ph.D. in Machine Learning from the U. Porto, Portugal. He possesses a world-class academic track with high impact publications at top tier venues in ML/AI fundamentals, 5 patents and multiple keynotes worldwide - ranging from Brisbane (Australia) to Las Palmas (Spain). | |||
16 Nov 2021 | Using Time Series Analysis to Uncover Why Gun Sales Increase After Mass Shootings - Maurizio Porfiri | 00:40:41 | |
In this episode we are joined by Professor Maurizio Porfiri from NYU, to talk about his latest academic research which is using data science to uncover why sales of guns in the USA increase after a mass shooting event. His interest and research was borne from a very personal experience 14 years ago when he experienced a mass shooting event at Virginia Tech where he was studying.
| |||
07 May 2021 | How to Leverage Data For Exponential Growth - Tarush Aggarwal | 01:07:31 | |
In this episode we are joined by an industry veteran who has worked for some of the biggest names in the enterprise Data world. Tarush Aggarwal shares his journey from his early days at Salesforce and then WeWork, right through to the present day. He reveals how to set Data Science & Engineering up for success in both small and large organisations. Episode Summary
| |||
14 Mar 2023 | Mapping forests: Verifying carbon offsetting with machine learning | 00:25:08 | |
In this episode Heidi Hurst returns to talk to us about how in her current role at Pachama she is using the power of machine learning to fight climate change. She discusses her work in measuring the capacity of existing forests and reforestation projects using satellite imagery. Episode Summary 1. The importance of carbon credits verification in mitigating climate change 2. How Pachama is using machine learning and satellite imagery to verify carbon projects 3. Three types of carbon projects: avoided deforestation, reforestation, and improved forest management 4. Challenges in using satellite imagery to measure the capacity of existing forests 5. The role of multispectral imaging in measuring density of forests 6. Challenges in collecting data from dense rainforests and weather obstructions 7. The impact of machine learning on scaling up carbon verification 8. Advancements in the field of satellite imaging, particularly in small satellite constellations | |||
01 Feb 2022 | The Pitfalls of Using AI Systems for Hiring | 00:44:43 | |
In this episode we are joined by Julia Stoyanovich from NYU, to talk about her work into how AI is being used in the hiring process. Whether you are responsible for hiring on behalf of a business or are a job seeker, you will find this podcast very interesting, but for very different reasons. Episode Summary
| |||
07 Oct 2020 | Philipp Koehn (Part 1) - How Neural Networks Have Transformed Machine Translation | 00:30:34 | |
Professor Philipp Koehn of Johns Hopkins University discusses the evolution of machine translation and the fundamentals for using Neural Networks to deliver Machine translation. Episode Summary:
Resources: Philipp Koehn latest book - Neural Machine Translation - Amazon link: https://www.amazon.com/Neural-Machine-Translation-Philipp-Koehn/dp/1108497322 | |||
10 Dec 2024 | Key Principles For Scaling AI In Enterprise: Leadership Lessons With Walid Mehanna | 01:03:57 | |
In this episode, we had the privilege of speaking with Walid Mehanna, Chief Data and AI Officer at Merck Group. Walid shares deep insights into how large, complex organizations can scale data and AI and create lasting impact through thoughtful leadership. As Chief Data & AI Officer of Merck Group, Walid led the Merck Data & AI Organization, delivering strategy, value, architecture, governance, engineering, and operations across the whole company globally. Hand in hand with Merck’s business sectors and their data offices, we harnessed the power of Data & AI. Walid is glad to be part of Merck as another curious mind dedicated to human progress. | |||
04 Mar 2024 | Using Open Source LLMs in Language for Grammatical Error Correction (GEC) | 00:50:27 | |
At LanguageTool, Bartmoss St Clair (Head of AI) is pioneering the use of Large Language Models (LLMs) for grammatical error correction (GEC), moving away from the tool's initial non-AI approach to create a system capable of catching and correcting errors across multiple languages. LanguageTool supports over 30 languages, has several million users, and over 4 million installations of its browser add-on, benefiting from a diverse team of employees from around the world. Episode Summary -
| |||
22 Oct 2020 | Deep Fakes (Part 1) - Technological Advancements & Impact on Society | 00:21:50 | |
This is Part one of our conversation about Deep Fakes with two experts in their respective fields. We talk to Dr Eileen Culloty of the Institute for Future Media and Journalism at Dublin City University and Dr Stephane Lathuiliere of Telecom Paris. Stephane reveals what is possible and what is not possible technically with current Deep Fakes Technology. Eileen helps us cut through the hype about Deep Fakes and tells us about their real world social and political impact. EPISODE SUMMARY:
RESOURCES: Video of First Order Motion Model For Video Animation: https://www.youtube.com/watch?v=u-0cQ-grXBQ&ab_channel=AliaksandrSiarohin
PROVENANCE program: https://fujomedia.eu/provenance/ | |||
06 Jun 2024 | Enhancing GenAI with Knowledge Graphs: A Deep Dive with Kirk Marple | 00:44:46 | |
In this episode we talk to Kirk Marple about the power of Knowledge Graphs when combined with GenAI models. Kirk explained the growing relevance of knowledge graphs in the AI era, the practical applications, their integration with LLMs, and the future potential of Graph RAG. Kirk Marple a veteran of Microsoft and General Motors, Kirk has spent the last 30 years in software development and data leadership roles. He also successfully exited the first startup he founded, RadiantGrid, acquired by Wohler Technologies. Now, as the technical founder and CEO of Graphlit, Kirk and his team are streamlining the development of vertical AI apps with their end-to-end, cloud based offering that ingests unstructured data and leverages retrieval augmented generation to improve accuracy, domain specificity, adaptability, and context understanding – all while expediting development. Episode Summary -
| |||
24 Sep 2024 | KP Reddy: How AI is Reshaping Startup Dynamics and VC Strategies | 01:01:53 | |
KP Reddy, founder and managing partner of Shadow Ventures, explains how AI is set to redefine the startup landscape and the venture capital model. KP shares his unique perspective on the rapidly evolving role of AI in entrepreneurship, offering insights into:
| |||
10 Nov 2020 | AI V Humans (Part 1) - Esports Legends Battle With AlphaStar (Google DeepMind) | 00:27:33 | |
Every so often on the podcast we will bring you something that is a little bit different. This episode is part one of a conversation with Esports Legends TLO & MaNa. They are professional Starcraft II players and they tell us the story of what it was like to compete against Google DeepMinds AlphaStar AI agent. This is a fascinating discussion about the technical capability of AI agents and about the psychology involved when Humans take on the machines. Episode Summary
Resources: Deepmind Alphastar Videos: https://deepmind.com/research/open-source/alphastar-resources TLO Profile: - https://liquipedia.net/starcraft2/TLO MaNa Profile: - https://liquipedia.net/starcraft2/MaNa | |||
28 Oct 2020 | Deep Fakes (Part 2) - Technological Advancements & Impact on Society | 00:20:24 | |
This is Part Two of our conversation about Deep Fakes with two experts in their respective fields. We talk to Dr Eileen Culloty of the Institute for Future Media and Journalism at Dublin City University and Dr Stephane Lathuiliere of Telecom Paris. EPISODE SUMMARY:
Resources: Video of First Order Motion Model For Video Animation: https://www.youtube.com/watch?v=u-0cQ-grXBQ&ab_channel=AliaksandrSiarohin PROVENANCE program: https://fujomedia.eu/provenance/ | |||
09 May 2023 | Data Strategy Evolved: How the Biological Model fuels enterprise data performance | 00:56:37 | |
In this episode Patrick McQuillan shares his innovative Biological Model - a concept you can use to enhance data outcome in large enterprises. The concept takes the idea that the best way to design a data strategy is to align it closely with a biological system. He discusses the power of centralized information, importance of data governance, and the necessity for a common performance narrative across an organization. Episode Summary - - Biological Model Concept - Centralized vs. Decentralized Data - Data Collection and Maturity - Horizontal translation layer - Partnership with vertical leaders - Curated data layers - Data dictionary for consistency - Focusing on vital metrics - Data Flow in Organizations - Biological Model Governance - Overcoming Inconsistency and Inaccuracy | |||
26 Nov 2020 | AI V Humans (Part 2) - Esports Legends Battle With AlphaStar (Google DeepMind) | 00:32:02 | |
Every so often on the podcast we will bring you something a little bit different. This episode is part two of our conversation with Esports Legends TLO & MaNa. They are professional Starcraft II players and they tell us the story of what it was like to compete against Google DeepMinds AlphaStar AI agent. This is a fascinating discussion about the technical capability of AI agents and about the psychology involved when Humans take on the machines. Episode Summary
Resources: Deepmind Alphastar Videos: https://deepmind.com/research/open-source/alphastar-resources TLO Profile: - https://liquipedia.net/starcraft2/TLO MaNa Profile: - https://liquipedia.net/starcraft2/MaNa | |||
14 Oct 2020 | Philipp Koehn (Part 2) - How Neural Networks have Transformed Machine Translation | 00:29:41 | |
This is Part 2 of our conversation with Professor Philipp Koehn of Johns Hopkins University. Professor Koehn is one of the world’s leading experts in the field of Machine Translation & NLP. In this episode we delve into commercial applications of machine translation, open source tools available and also take a look into what to expect in the field in the future. Episode Summary:
Resources:
Philipp Koehn latest book - Neural Machine Translation - Amazon link:
https://www.amazon.com/Neural-Machine-Translation-Philipp-Koehn/dp/1108497322
Omniscien Technologies - Leading Enterprise Provider of machine translation services:
Open Source tools:
- Fairseq https://fairseq.readthedocs.io/en/latest/ - Marian https://marian-nmt.github.io/ - OpenNMT https://opennmt.net/ - Sockeye https://awslabs.github.io/sockeye/
Translated texts (parallel data) for training:
- OPUS http://opus.nlpl.eu/ - Paracrawl https://paracrawl.eu/
Two papers mentioned about excessive use of computing power to train NLP models:
- GPT-3 https://arxiv.org/abs/2005.14165 - Roberta https://arxiv.org/abs/1907.11692 |