Open Data Science
Open Data Science
  • 557
  • 489 994
ODSC Webinar | Iterative Feature Engineering for Superior Machine Learning Models
This webinar explores the role of feature engineering in improving machine learning models. It discusses the iterative process of refining the feature set using a dataset and incorporating domain-specific knowledge. Four experiments are presented, demonstrating strategies to enhance model accuracy and reduce error.
Attendees will gain insights into effective approaches for improving predictive model performance through iterative feature engineering.
- Learn how to apply domain knowledge and experiment with feature types
- Discover what iterative feature engineering is and why it matters
- See practical, hands-on use of iterative feature engineering
More pages: app.aiplus.training/courses/Iterative-Feature-Engineering-for-Superior-Machine-Learning-Models
→ To watch more videos like this, visit aiplus.training ​←
Do You Like This Video? Share Your Thoughts in Comments Below
Also, You can visit our website and choose the nearest ODSC Event to attend and experience all our Trainings and Workshops:
odsc.com/boston/
Sign up for the newsletter to stay up to date with the latest trends in data science: opendatascience.com/newsletter/
Follow Us Online!
• Facebook: OPENDATASCI/
• Blog: opendatascience.com/
• LinkedIn: www.linkedin.com/company/open-data-science/
• Twitter: _odsc
Переглядів: 77

Відео

If We Want AI to be Interpretable, We Need to Measure Interpretability with Jordan Boyd-Graber, PhD
Переглядів 77День тому
Discover how we can transform AI from a mysterious black box into a transparent tool with interpretable metrics. In this insightful talk, Jordan Boyd-Graber, PhD, explores the necessity of measuring interpretability in AI. He introduces two novel metrics for both unsupervised and supervised AI methods. Learn about the "intruder" interpretability metric for topic models and a multi-armed bandit ...
Testing Positive Semidefiniteness and Eigenvalue Approximation with David P. Woodruff, PhD
Переглядів 35День тому
Join David P. Woodruff, PhD, as he explores the intricacies of testing positive semidefiniteness and eigenvalue approximation. In this talk, Dr. Woodruff presents optimal algorithms for determining if a matrix 𝐴 is positive semidefinite or has a minimum eigenvalue sufficiently negative. He introduces a novel random walk algorithm that uses only a single vector-matrix-vector product per iteratio...
ODSC Webinar | Inference Benchmarking of Prominent Open-Source Large Language Models (LLMs)
Переглядів 8014 днів тому
In the upcoming webinar, we delve into the inference benchmarking of prominent open-source Large Language Models such as the 13B and 70B Llama-2. We have used a diverse range of compute shapes available inOracle Cloud Infrastructure (OCI), like Intel, AMD, ARM CPUs, and NVIDIA GPUs. A core aspect of our discussion will center on the crucial metrics of Tokens per Second and the corresponding lat...
Winning The Room: Creating And Delivering An Effective Data-Driven Presentation with Bill Franks
Переглядів 82Місяць тому
Are you ready to master the art of delivering data-driven presentations to non-technical audiences? Then you have to check out Bill Franks's insightful session based on his latest book, "Winning The Room". This video offers straightforward strategies and practical tips to enhance your presentation skills, ensuring your data not only informs but also engages and persuades your audience. Learn ho...
Semantic Search with Nils Reimers
Переглядів 139Місяць тому
Uncover the power of semantic search with Nils Reimers in his enlightening talk, "Semantic Search." Move beyond the limitations of traditional lexical search systems, which often fail to retrieve relevant results, leading to user frustration. Discover how pre-trained transformer networks have revolutionized search capabilities, enabling dramatically better outcomes with minimal effort. Semantic...
Robustness to Adversarial Inputs and Tail Risk via Boosting with Pradeep Ravikumar, PhD
Переглядів 66Місяць тому
Dr. Pradeep Ravikumar addresses the pressing challenges of deploying machine learning models in high-stakes environments, particularly the need for robustness against adversarial inputs that can drastically alter predictions. Highlighting a novel strategy, he explores the use of an ensemble of neural networks designed to defend against such threats while ensuring high performance on the least f...
ODSC Webinar | The Power of Feature Engineering for Manufacturing Data Scientists
Переглядів 142Місяць тому
Explore the untapped possibilities of Predictive Maintenance! Join data scientists, engineers, and manufacturing professionals on an engaging journey through predictive maintenance use cases, shedding light on this often underexplored realm of Machine Learning. Hear from Hari Narayanan, a key member of dotData’s Data Science team, as he delves into the power of predictive analytics, machine lea...
Reasoning in Natural Language with Dan Roth, PhD
Переглядів 176Місяць тому
In, "Reasoning in Natural Language,", you’ll join Dr. Dan Roth as he explores the intricate world of semantics and its pivotal role in understanding natural language. Delve into why, despite advancements in AI and machine learning, tasks requiring an understanding of truth and real-world context still need to be completed. Dr. Roth highlights cutting-edge approaches to tackle these issues, from...
Using Data Science to Better Evaluate American Football Players with Eric Eager, PhD
Переглядів 2152 місяці тому
Dive into the transformative power of data science in the world of American football with Eric Eager, PhD's "Using Data Science to Better Evaluate American Football Players." In this presentation, Dr. Neubig, an expert in machine learning and natural language processing, showcases how the sport is evolving through advanced analytics. 🏈💻 From play-by-play and charting data to the revolutionary p...
ODSC Webinar | From Raw Data to Insights: Simplifying Data Validation and Enrichment
Переглядів 802 місяці тому
As businesses become more data-driven, they are increasingly in need of data that provides answers to their everyday questions. In this data-rich world, you must go the extra mile to ensure that the data you rely on for downstream operations and analytics data is accurate, complete, and fit-for-purpose. High-quality address and contact data is particularly critical for creating an agile, insigh...
Truth Checker: Generative Large Language Models and Hallucinations with Chandra Khatri
Переглядів 2632 місяці тому
Truth Checker: Generative Large Language Models and Hallucinations by Chandra Khatri navigates the era of artificial intelligence, understanding the capabilities and limitations of these technologies becomes crucial, especially their tendency to produce confident but inaccurate information, known as hallucinations. This session aims to unveil the mechanisms of Truth Checker models that pinpoint...
Is My NLP Model Working? The Answer is Harder Than You Think with Graham Neubig, PhD
Переглядів 1172 місяці тому
Graham Neubig, Ph.D., unpacks the intricate challenges of evaluating Natural Language Processing models amidst their growing role in diverse applications in Is My NLP Model Working? The Answer is Harder Than You Think, This talk highlights the importance of understanding NLP's capabilities and limitations, from enhancing AI-driven technologies to avoiding PR mishaps. Dr. Neubig introduces autom...
ODSC Webinar | Building Responsible and Safe Generative AI Applications
Переглядів 702 місяці тому
As large language models (LLMs) become more widely adopted, it is crucial to understand their effective utilization, copilot development, evaluation, operationalization, and monitoring in real-world applications. This session will provide insights into incorporating responsible AI practices and safety features into your generative AI applications. You will gain knowledge on assessing your copil...
Continual Learning of Natural Language Processing Tasks with Bing Liu, PhD
Переглядів 1332 місяці тому
Continual Learning of Natural Language Processing Tasks with Bing Liu, PhD
Navigating the GENAI Frontier: Empowering Data Scientists as Ethical Innovators with Alison Cossette
Переглядів 822 місяці тому
Navigating the GENAI Frontier: Empowering Data Scientists as Ethical Innovators with Alison Cossette
ODSC Webinar | Supercharging your Data Science projects with GitHub tools
Переглядів 1042 місяці тому
ODSC Webinar | Supercharging your Data Science projects with GitHub tools
Infuse Generative AI in your Apps Using Azure OpenAI Service with
Переглядів 802 місяці тому
Infuse Generative AI in your Apps Using Azure OpenAI Service with
The Tangent Information Modeler, time series modeling reinvented with Philip Wauters
Переглядів 862 місяці тому
The Tangent Information Modeler, time series modeling reinvented with Philip Wauters
Orchestrating Generative AI Workflows to Deliver Business Value with Hugo Bowne-Anderson, PhD
Переглядів 1752 місяці тому
Orchestrating Generative AI Workflows to Deliver Business Value with Hugo Bowne-Anderson, PhD
Evaluating Synthetic Data with Post-Processing Techniques with Samruddhi (Sam) Kulkarni
Переглядів 1573 місяці тому
Evaluating Synthetic Data with Post-Processing Techniques with Samruddhi (Sam) Kulkarni
Adopting Language Models Requires Risk Management - This is How with Patrick Hall
Переглядів 1293 місяці тому
Adopting Language Models Requires Risk Management - This is How with Patrick Hall
Towards Explainable and Language-Agnostic LLMs with Walid S. Saba
Переглядів 2824 місяці тому
Towards Explainable and Language-Agnostic LLMs with Walid S. Saba
ODSC Webinar | Unlocking the Power of Knowledge Graphs for Generative AI in Enterprise Environments
Переглядів 3374 місяці тому
ODSC Webinar | Unlocking the Power of Knowledge Graphs for Generative AI in Enterprise Environments
Generative Ai vs. AGI: Strengths and Weaknesses of Large Language Models with Dr. Ben Goertzel
Переглядів 8404 місяці тому
Generative Ai vs. AGI: Strengths and Weaknesses of Large Language Models with Dr. Ben Goertzel
From AI to GX: The Quantum Leap in Algorithmic Evolution with Jepson Taylor
Переглядів 1664 місяці тому
From AI to GX: The Quantum Leap in Algorithmic Evolution with Jepson Taylor
ODSC Webinar | Leveraging Location Intelligence Data for Data Scientists
Переглядів 994 місяці тому
ODSC Webinar | Leveraging Location Intelligence Data for Data Scientists
PyTorch 2.1 - New Developments with Supriya Rao
Переглядів 1334 місяці тому
PyTorch 2.1 - New Developments with Supriya Rao
Building Robust and Scalable Recommendation Engines for Online Food Delivery
Переглядів 1735 місяців тому
Building Robust and Scalable Recommendation Engines for Online Food Delivery
Representation Learning on Graphs and Networks - Dr. Petar Veličković
Переглядів 3445 місяців тому
Representation Learning on Graphs and Networks - Dr. Petar Veličković

КОМЕНТАРІ

  • @dmitriik3145
    @dmitriik3145 5 днів тому

    Great talk and very crisp presentation. Big thanks!

  • @asoka0202
    @asoka0202 13 днів тому

    Excellent Video and explanation

  • @sofdff
    @sofdff Місяць тому

    Very good

  • @DrAIScience
    @DrAIScience Місяць тому

    Are you the channel owner??

  • @DrAIScience
    @DrAIScience Місяць тому

    Do you have a video about beit or dino?

  • @DrAIScience
    @DrAIScience Місяць тому

    Very very very nice explanation!!! I like learning the foundation/origin of the concepts where models are derived..

  • @horaceburke8459
    @horaceburke8459 2 місяці тому

    😜 'promo sm'

  • @changeyourperspective1291
    @changeyourperspective1291 2 місяці тому

    Great explanation!

  • @Feel_theagi
    @Feel_theagi 2 місяці тому

    So it's a 50 mins Azure AI advert

  • @Since-em2vy
    @Since-em2vy 2 місяці тому

    Great Video!!

  • @Typicaltorturedartist
    @Typicaltorturedartist 2 місяці тому

    A lot of people take out their frusteration on AI systems and robots....the question is would that same person have hurt a living creature had that AI not existed?

  • @PeterBowdenLive
    @PeterBowdenLive 3 місяці тому

    Thank you. As someone navigating reported self-awareness by advanced LLMs I'd like to affirm the urgency of engaging with this topic.

  • @anastasiamadrevska1668
    @anastasiamadrevska1668 3 місяці тому

    Thank you so much, really useful!

  • @mohammedrakib3736
    @mohammedrakib3736 3 місяці тому

    Fantastic Video! Really loved the detailed explanation step-by-step.

  • @EricKay_Scifi
    @EricKay_Scifi 3 місяці тому

    I attended my first 'data science for good' meeting at ODSC West several years ago. It opened my eyes to algorithmic bias.

  • @missh1774
    @missh1774 3 місяці тому

    Im an avid admirer of Ben's native language in computer neural networks. Not so much with his Noosphere linguistics (⁠◔⁠‿⁠◔⁠)

  • @Dan-dy8zp
    @Dan-dy8zp 4 місяці тому

    I wish these issues would get more attention. AGI is the most important issue we face.

    • @EricKay_Scifi
      @EricKay_Scifi 3 місяці тому

      24:38 My novel, Above Dark Waters, is about an AI therapy company, which uses brainwave data to inform the artificial therapist. They end up combining to make a super-manipulative AI which uses with Generative AI to make digital fentanyl.

  • @leonlysak4927
    @leonlysak4927 4 місяці тому

    Ben always dropping knowledge bombs in the most random youtube channels lol

  • @shephusted2714
    @shephusted2714 4 місяці тому

    lots of improvement happening right now to llm but people have to go to efforts to fully unbox capabilities - using multi models, uncensored models, p2p data and model training, and especially real time data agg - once these barriers are overcome then we will see much better commercial use llm and it will happen even if it is a gradual process - the domain creep is real and the limitations and compromises will be diminished as we go brom big tech ai to really open source ai in the next 5 years - the hw/sw stacks have to mature and catch up as well but it is clear that this will happen - cxl and other accelerators will have massive impacts and once they do filter down from data centers to prosumers and home labbers and the small/med biz sector then we will see economies of scale kick in and much more innovation and development - not is not the time and although it may appear to be sort of a fever dream it probably will happen - more i/o like pci-e v6 is going to help, usb5 and faster networking will also help.

  • @johnclay7422
    @johnclay7422 4 місяці тому

    good contribution ... sir amzing video ....

  • @futureworldhealing
    @futureworldhealing 4 місяці тому

    thanks for the informative interview about how to interview!

  • @geaca3222
    @geaca3222 5 місяців тому

    Great interview, very useful information. I also love the online ai safety book, really great initiatives 👍

  • @chuckystein3103
    @chuckystein3103 5 місяців тому

    Thanks for sharing

  • @Hitesh10able
    @Hitesh10able 6 місяців тому

    when I run following code suggested in this video (at 8:19) for dynamic quantization it starts training with some random natural images for 100 epochs, I don't want to do training again I just wnat to quantize my pretrained weights: from ultralytics import YOLO import torch import torch.quantization model=YOLO('pre_trained_weights.pt') model.load_state_dict(torch.load('checkpoint.pth')) qmodel = torch.quantization.quantize_dynamic(model, dtype = torch.quint8)

  • @chemseddineberbague419
    @chemseddineberbague419 6 місяців тому

    thanks alot ))

  • @vipulrvyas
    @vipulrvyas 6 місяців тому

    How many RAI Parameter / Use cases this is tested against?

  • @silberlinie
    @silberlinie 6 місяців тому

    A - as always - very valuable conversation with Bostrom In my opinion, Sheamus McGovern is a terrible host. Two points. 1. He talks and chats for far too long instead of listening to his guest. A brief, directed outline of his question would benefit both the viewer and the guest. 2. His voice is terrible. He would have gained a lot if he could make himself audible through speech synthesis.

    • @retromograph3893
      @retromograph3893 6 місяців тому

      That's a bit harsh, i thought he was quite ok. His audio sound quality is very bad though, they need to work on that. The room he's in has got bad acoustic (boxy), so he needs to get the mic closer to his mouth.

  • @LuisFelix82
    @LuisFelix82 6 місяців тому

    Excellent topic, very informative. Thank you!

  • @jhjbm1959
    @jhjbm1959 6 місяців тому

    This video provides a clear step by step explanation how to get from images to input features for Transformer encoders, which has proven hard to find anywhere else. Thank you.

  • @DonRua
    @DonRua 7 місяців тому

    Ms. Qi is altering the course of my life. Her honesty, integrity, kind consideration and upbringing inspire me. I've delved into everything about her. A financial loss of 1/3 of my retirement due to a banking error prompted me to cease trading in a fixed fund. Despite anticipating a quick correction, it took the bank two years to rectify, and they denied any responsibility. Frustrated, I missed out on the V-shaped rebound. Savvy investor friends cautioned against immediate action, and paid subscriptions taught me not to blindly trust them. I spent 10,000 hours focusing on fundamentals, employing geopolitical, macro, micro, banking, currencies, and NEWS analyses. While many lamented losses, I achieved a mid-double-digit gain this year alone. Enter Ms. Qi - my young hero. Unhindered by job offers due to her exceptional skills and honesty, Ms. Qi might have posed logical questions during her internship, facing resistance from supervisors who'd admired the wrong approach for decades. Her journey parallels Elon Musk's grandfather, who, beginning in a small Canadian town, became Canada's first licensed chiropractor and initiated the chiropractic association that still thrives. Confronting political issues led to his imprisonment and eventual relocation to South Africa. From my reading Ms. Qi's parents escaped a dark chapter in history as they left for America. Eventually, the figurehead of that traumatic event “hand picked” the seemingly least intelligent second-generation individual to protect those involved in the massacre. However, this "dumb" leader purged/incarcerated over a million of the original “cash cow” network, cleaning up a 40-year establishment and potentially jeopardizing the country’s future-a clear definition of “what goes around comes around”. Meanwhile, the two newcomers in America found solace in a simple life. Three decades later, Ms. Qi, once a little baby, now stands at the pinnacle of success, showcasing potential akin to Elon Musk's visionary genes. Inspired by her, I am revisiting day trading, recognizing the promise she holds for future generations.

  • @saimasideeq7254
    @saimasideeq7254 7 місяців тому

    thankyou much clearer

  • @djjjjj
    @djjjjj 7 місяців тому

    Maybe the most important question ever. Imagine the ethics/control/treatment of trillions upon trillions of sentient minds being dictated by the unethical 😳

    • @delatroy
      @delatroy 6 місяців тому

      Yeah. The whole point of crating ai is so we can enslave it on the assumption that it’ll be ethical. With no way to test, I guess we’ll assume it’s fine 🤔

  • @yubifu9186
    @yubifu9186 7 місяців тому

    MIT Math Dean

  • @patrickjdarrow
    @patrickjdarrow 7 місяців тому

    Solid high level overview of proven strategies.

  • @ouafahachem4377
    @ouafahachem4377 7 місяців тому

    thank you so much for sharing the talk it was really insightful. I'm a data scientist with three years of experience at an AI startup. I've worked in a small team and gained skills in data analytics, data engineering, data pipeline architecture, model training workflows, reproducibility, and process improvement. Now, I'm interested in leading new data projects and teams. Do the age and number of years of experience matter for such role?

  • @TmoneyProductions
    @TmoneyProductions 8 місяців тому

    What if in a document you are looking for multiple of the same fields? For instance, if there was a document with multiple different names, and you wanted to look for how many different names show up? I dont want individual fields for name 1 name 2 name 3, i just want them classified all as the same field "name".

  • @dougiehwang9192
    @dougiehwang9192 8 місяців тому

    Well explained!! ❤ THX a lot for sharing this.

  • @jimimased1894
    @jimimased1894 8 місяців тому

    sagoi my hero!

  • @ashish-blessings
    @ashish-blessings 8 місяців тому

    Thank you

  • @k3el07
    @k3el07 9 місяців тому

    Thanks for sharing. I'm keen to learn more about this.

  • @itzhexen0
    @itzhexen0 9 місяців тому

    If AI was actually intelligent it could destroy all of these programmer's jobs. Let me know when it can do that. Because I want it to do that.

  • @AdamRossNelson
    @AdamRossNelson 9 місяців тому

    Oops. I mis-poke about the history of data science. When I said mid-1990s... I meant to say mid-1900s! Thanks for having me on the show! I look forward to seeingyour next show coming up. ~ Adam

  • @Avfc_csk
    @Avfc_csk 9 місяців тому

    Great session 👏👏

  • @liangcheng9856
    @liangcheng9856 10 місяців тому

    awesome

  • @improvement_developer8995
    @improvement_developer8995 10 місяців тому

    Tax evader 🤮

  • @improvement_developer8995
    @improvement_developer8995 10 місяців тому

    🤮

  • @user-co6pu8zv3v
    @user-co6pu8zv3v 10 місяців тому

    Thank you, sir

  • @diegoguerra4829
    @diegoguerra4829 10 місяців тому

    great video