• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
  • Home
  • About Us
  • Contact Us

iHash

News and How to's

  • Prodigy Afterschool Masterclasses for Kids for $99

    Prodigy Afterschool Masterclasses for Kids for $99
  • 10.1" WiFi Digital Photo Frame with Photo/Video Sharing for $149

    10.1" WiFi Digital Photo Frame with Photo/Video Sharing for $149
  • 8" WiFi Cloud Photo Frame for $112

    8" WiFi Cloud Photo Frame for $112
  • 8" WiFi Digital Photo Frame with Auto Rotation & Photo/Video Sharing for $112

    8" WiFi Digital Photo Frame with Auto Rotation & Photo/Video Sharing for $112
  • Wireless Wall Tap Smart Plug for $39

    Wireless Wall Tap Smart Plug for $39
  • News
    • Rumor
    • Design
    • Concept
    • WWDC
    • Security
    • BigData
  • Apps
    • Free Apps
    • OS X
    • iOS
    • iTunes
      • Music
      • Movie
      • Books
  • How to
    • OS X
      • OS X Mavericks
      • OS X Yosemite
      • Where Download OS X 10.9 Mavericks
    • iOS
      • iOS 7
      • iOS 8
      • iPhone Firmware
      • iPad Firmware
      • iPod touch
      • AppleTV Firmware
      • Where Download iOS 7 Beta
      • Jailbreak News
      • iOS 8 Beta/GM Download Links (mega links) and How to Upgrade
      • iPhone Recovery Mode
      • iPhone DFU Mode
      • How to Upgrade iOS 6 to iOS 7
      • How To Downgrade From iOS 7 Beta to iOS 6
    • Other
      • Disable Apple Remote Control
      • Pair Apple Remote Control
      • Unpair Apple Remote Control
  • Special Offers
  • Contact us

Solving Unstructured Data: NLP and Language Models as Part of the Enterprise AI Strategy 

Jun 1, 2022 by iHash Leave a Comment

In this special guest feature, Prabhod Sunkara, Co-founder and COO of nRoad, Inc., discusses how enterprises are increasingly relying on unstructured data for analytic, regulatory, and corporate decision-making purposes. nRoad is a purpose-built natural-language processing (NLP) platform for unstructured data in the financial services sector and the first company to declare a “War on Documents. Prior to nRoad, Prabhod held various leadership roles in product development, operations, and solution architecture. His passion for building and delivering outcome-driven AI solutions has successfully improved processes at large global financial firms such as Bank of America, Merrill Lynch, Morgan Stanley, and UBS.

Unstructured data, the deep, dark data that’s prevalent across the enterprise, but not always transparent or usable, continues to be a top business challenge. Data that lacks a predefined data model is typically considered unstructured data, including everything from text-heavy documents and websites to images, video files, chatbots. audio streams, and social media posts. Collectively, by most estimates, these types of data account for 80 to 90 percent or more of the overall digital data universe. 

Growth and Challenges of Unstructured Data

The volume of unstructured data is set to grow from 33 zettabytes in 2018 to 175 zettabytes, or 175 billion terabytes, by 2025, according to the latest figures from research firm ITC. Thankfully, there is an increased awareness of the explosion of unstructured data in enterprises. For example, a recent study showed that nearly 80 percent of financial services organizations are experiencing an influx of unstructured data. Furthermore, most of the participants in the same study indicated that 50 to 90 percent of their current data is unstructured.

Until recently, it hasn’t been possible for computers to understand this data. Now, enterprises are increasingly relying on unstructured data for analytic, regulatory, and corporate decision-making purposes. As unstructured data becomes more valuable to the enterprise, technology and data teams are racing towards upgrading their infrastructure to meet the growing cloud-based services and the sheer explosion of data internally and externally. 

At the same time, these teams are having active conversations around leveraging insights buried in unstructured data sources. The spectrum of use cases ranges from infusing operational efficiencies to proactively servicing the end customer. To that effect, CIOs and CDOs are actively evaluating or implementing solutions ranging from basic OCR Plus solutions to complex large language models coupled with machine or deep learning techniques.

Incorporating NLP and Language Models into Your Data Strategy

A considerable portion of the enterprise’s unstructured data is textual. This can vary from legal contracts, research documents, customer complaints using chatbots, and everything in between. So naturally, organizations are adopting Natural Language Processing (NLP) as part of their AI and digitization strategy.  

Over the past decade, there has been considerable research and advances in NLP. Most notably, the emergence of transformer models is allowing enterprises to move beyond simple keyword-based text analytics to more advanced sentiment and semantic analysis. While NLP will enable machines to quantify and understand text at its core, resolving ambiguity remains a significant challenge. One way to tackle ambiguity resolution is to incorporate domain knowledge and context into the respective language model(s). Leveraging fine-tuned models such as LegalBERT, SciBERT, FinBERT, etc., allows for a more streamlined starting point to specific use cases.

At the outset, fine-tuned models establish a strong base. However, similar to the larger models, such as BERT and GPT3, these models still fall short of meeting most companies’ business outcome needs. As a result, enterprises operating in multiple markets, regions, and languages should consider incorporating cross-domain language models, multilingual models, and/or transfer learning techniques to accommodate broader challenges.

While there continues to be research and development of more extensive and better language model architectures, there is no one-size-fits-all solution today. As a result, enterprises trying to build their language models can also fall short of the organization’s objectives. Other factors impacting an organization’s unstructured data strategies lack of annotated data, unavailability of training data, lack of organizational understanding in adopting such models, and the simple need to quickly develop and deploy a production-grade solution at an affordable computational cost as well as ROI realizations. 

How Enterprises Can Tackle Their Growing Unstructured Data Problem

Data and a technology strategy play a key role in a typical enterprise AI roadmap. Most organizations are able to plan and manage structured data effectively. However, unstructured data is where the real context and insights are buried, and organizations drown in this data. It behooves the CDO organization of an enterprise to take this data into account and intelligently plan to utilize this information.   

The biggest challenge often seen is the lack of organizational alignment of an enterprise’s AI strategy. While this isn’t directly related to ML and DL models, leadership alignment, a sound understanding of the data and outcomes, and a diverse team composition are critical for any AI strategy in an enterprise. A quantifiable, outcome-driven approach allows the teams to focus on the end goal versus hype-driven AI models. For example, GPT3 is a heavy language prediction model that is often not highly accurate. There have been instances where GPT3-based models have propagated misinformation, leading to public embarrassment of an organization’s brand.

Training and building deep learning solutions are often computationally expensive, and applications that need to apply NLP-driven techniques require computational and domain-rich resources. Hence, when starting an in-house AI team, organizations need to emphasize problem definition and measurable outcomes. In addition to problem definition, product teams must focus on data variability, complexity, and availability. These steps will help strategize an approach, identify the suitable models as a foundational layer, and establish a sound data governance and training function.

An alternative and cost-effective approach is choosing a  third-party partner or vendor to help jump-start your strategy. Vendor-based technology allows enterprises to take advantage of their best practices and implementation expertise in larger language models, and the vast experience they bring to the table based on other problem statements they have tackled.

Incorporating a strategy to manage the enterprise unstructured data problem and leveraging NLP techniques are becoming critical components of an organization’s data and technology strategy. Although RPA, OCR Plus, or basic statistical-based ML models will not solve the complete problem, incorporating deep learning methods should be a path forward.

Sign up for the free insideBIGDATA newsletter.

Join us on Twitter: @InsideBigData1 – https://twitter.com/InsideBigData1

Source link

Share this:

  • Facebook
  • Twitter
  • Pinterest
  • LinkedIn

Filed Under: BigData

Special Offers

  • Prodigy Afterschool Masterclasses for Kids for $99

    Prodigy Afterschool Masterclasses for Kids for $99
  • 10.1" WiFi Digital Photo Frame with Photo/Video Sharing for $149

    10.1" WiFi Digital Photo Frame with Photo/Video Sharing for $149
  • 8" WiFi Cloud Photo Frame for $112

    8" WiFi Cloud Photo Frame for $112
  • 8" WiFi Digital Photo Frame with Auto Rotation & Photo/Video Sharing for $112

    8" WiFi Digital Photo Frame with Auto Rotation & Photo/Video Sharing for $112
  • Wireless Wall Tap Smart Plug for $39

    Wireless Wall Tap Smart Plug for $39

Reader Interactions

Leave a Reply Cancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Primary Sidebar

E-mail Newsletter

  • Facebook
  • GitHub
  • Instagram
  • Pinterest
  • Twitter
  • YouTube

More to See

insideBIGDATA Latest News – 6/27/2022

Jun 27, 2022 By iHash

Cybersecurity Experts Warn of Emerging Threat of “Black Basta” Ransomware

Jun 27, 2022 By iHash

Tags

* Apple Cisco computer security cyber attacks cyber crime cyber news Cyber Security cybersecurity cyber security news cyber security news today cyber security updates cyber threats cyber updates data breach data breaches google hacker hacker news Hackers hacking hacking news how to hack incident response information security iOS iOS 7 iOS 8 iPhone iPhone 6 Malware microsoft network security Privacy ransomware malware risk management security security breaches security vulnerabilities software vulnerability the hacker news Threat update video web applications

Latest

Prodigy Afterschool Masterclasses for Kids for $99

Expires June 28, 2122 23:59 PST Buy now and get 85% off KEY FEATURES Unlock Your Child’s Potential For Success! No dream is too big when you have the tools to achieve it. Whether your child dreams of saving lives as a doctor or inspiring people through the arts, Prodigy will give them the tools […]

10.1" WiFi Digital Photo Frame with Photo/Video Sharing for $149

Expires June 25, 2122 23:59 PST Buy now and get 6% off KEY FEATURES Send Pictures and Videos from your smartphone to eco4life WiFi Digital Photo Frame, from anywhere in the world using the eco4life App. The eco4life smart frame is simply the best way to enjoy your favorite photos and videos with your families […]

Charlie Klein

Key-Thoughts on Cross-Organizational Observability Strategy

Logz.io ran two surveys earlier this year to better understand current trends, challenges, and strategies for implementing more effective and efficient observability – including the DevOps Pulse Survey and a survey we ran with Forrester Research. Together, we received responses from 1300+ DevOps and IT Ops practitioners on observability challenges, opportunities, and ownership strategies. Additionally, […]

Wi-Fi 1080p Indoor 360° View PTZ IP Camera for $57

Expires June 25, 2122 23:59 PST Buy now and get 17% off KEY FEATURES Experience the flexibility and power of 7/24 all-day recording with this 360° PTZ IP Camera. It shows you live videos on your phone in 1920×1080 full HD resolution, day or night. It’s also packed with two-way audio, advanced night vision, and […]

Survey Results Identifying the Benefits and Challenges of RPA

Robocorp, a top provider of Gen2 robotic process automation (RPA), announced the results of their State of RPA survey, which was designed to understand the challenges users face with current RPA solutions. The results will help usher in the next generation of enterprise automation – Gen2 RPA. Conducted online in May 2022, The State of […]

How is IoT Changing the Future of Cruising?

In this special guest feature, Ian Richardson, CEO & Co-Founder, theICEway, discusses how as the world continues to open for travel, cruise industry leaders are looking to leverage the next wave of travel technology to improve the passenger experience. With 20+ years of experience in both IT and the cruise industry, Ian Richardson co-founded theICEway […]

Jailbreak

Pangu Releases Updated Jailbreak of iOS 9 Pangu9 v1.2.0

Pangu has updated its jailbreak utility for iOS 9.0 to 9.0.2 with a fix for the manage storage bug and the latest version of Cydia. Change log V1.2.0 (2015-10-27) 1. Bundle latest Cydia with new Patcyh which fixed failure to open url scheme in MobileSafari 2. Fixed the bug that “preferences -> Storage&iCloud Usage -> […]

Apple Blocks Pangu Jailbreak Exploits With Release of iOS 9.1

Apple has blocked exploits used by the Pangu Jailbreak with the release of iOS 9.1. Pangu was able to jailbreak iOS 9.0 to 9.0.2; however, in Apple’s document on the security content of iOS 9.1, PanguTeam is credited with discovering two vulnerabilities that have been patched.

Pangu Releases Updated Jailbreak of iOS 9 Pangu9 v1.1.0

  Pangu has released an update to its jailbreak utility for iOS 9 that improves its reliability and success rate.   Change log V1.1.0 (2015-10-21) 1. Improve the success rate and reliability of jailbreak program for 64bit devices 2. Optimize backup process and improve jailbreak speed, and fix an issue that leads to fail to […]

Activator 1.9.6 Released With Support for iOS 9, 3D Touch

  Ryan Petrich has released Activator 1.9.6, an update to the centralized gesture, button, and shortcut manager, that brings support for iOS 9 and 3D Touch.

Copyright iHash.eu © 2022
We use cookies on this website. By using this site, you agree that we may store and access cookies on your device. Accept Read More
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT